Enabled when
NOVITA_API_KEY is set in your environment.Overview
The@standardagents/novita package provides a Novita AI provider factory for Standard Agents. It targets Novita’s OpenAI-compatible Chat Completions API and adds typed providerOptions, authenticated model discovery, request inspection, tool calling, streaming, structured outputs, and live catalog pricing.
Key Features
- Direct integration with
https://api.novita.ai/openai/v1/chat/completions - Live model discovery from
https://api.novita.ai/openai/v1/models - Tool calling, structured outputs, streaming, and reasoning controls
- Catalog price normalization into
inputPrice,outputPrice,cachedPrice, andusage.pricing
Installation
npm
pnpm
yarn
Quick Start
agents/models/novita_reasoning.ts
Provider Options
Available Options
| Option | Type | Description |
|---|---|---|
seed | number | Best-effort deterministic sampling seed |
frequency_penalty | number | Penalize repeated tokens based on frequency |
presence_penalty | number | Penalize repeated tokens based on presence |
repetition_penalty | number | Novita repetition penalty |
top_k | number | Top-k sampling limit |
min_p | number | Minimum probability threshold relative to the most likely token |
logit_bias | Record<string, number> | Token logit biases |
logprobs | boolean | Return output token log probabilities |
top_logprobs | number | Number of top logprobs to return |
separate_reasoning | boolean | Request reasoning content separately when supported |
enable_thinking | boolean | Enable or disable model thinking when supported |
modalities | string[] | Output modalities for models that can return audio |
Model IDs
Novita model IDs are sent directly, for example:zai-org/glm-5.2moonshotai/kimi-k2.7-codeminimax/minimax-m3deepseek/deepseek-v4-prodeepseek/deepseek-v4-flashdeepseek/deepseek-v3.2
https://api.novita.ai/openai/v1/models and exposes those models in the Standard Agents UI.
Novita reports catalog prices as scaled integers. The provider normalizes those values to USD per 1M tokens before returning inputPrice, outputPrice, cachedPrice, and per-request usage.pricing.
When Novita includes pricing.input_cache_read.price_per_m, the provider maps it to usage.pricing.cachedInputPerMillion and uses it for prompt_tokens_details.cached_tokens. Cache write tokens remain normal prompt input tokens and are priced at the standard input rate.
Custom or private Novita models should still set inputPrice, outputPrice, and cachedPrice explicitly if they do not appear in Novita’s model catalog.
Environment Setup
.dev.vars
Notes
- The provider uses Chat Completions, not OpenAI Responses.
- Novita’s model catalog requires authentication.
reasoning.levelautomatically enablesseparate_reasoningandenable_thinking; set provider options directly when you need model-specific control.- For custom base URLs, configure the provider
baseUrlslot with the full OpenAI-compatible base, such ashttps://api.novita.ai/openai/v1.
Related
Models
Core model concepts
Cerebras Provider
Compare another Chat Completions provider
OpenRouter Provider
Compare with a multi-provider gateway