Review list pricing by model group, compare input and output rates, and align teams on how model choice affects cost, context, and infrastructure posture.
Granular usage feedback and budgeting controls empower users to manage their spend.
Compare price with context window, infrastructure type, and model family in one place. Auto supplier error capture and redirect for higher uptime and less agentic flow disruption.
Role based access, GDPR & ISO compliance, multi-provider for resilience.
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Claude Opus 4.6
claude-opus-4.6 |
chat | 200K | cloud | $15.00 | $75.00 | US (Anthropic) | Live |
|
GPT-5.2 Codex
gpt-5.2-codex |
chat | 256K | cloud | $2.50 | $10.00 | US (OpenAI) | Live |
|
GPT-5.3 Codex
gpt-5.3-codex |
chat | 400K | cloud | $1.75 | $14.00 | US (OpenAI) | Live |
|
Claude Sonnet 4.6
claude-sonnet-4.6 |
chat | 200K | cloud | $3.00 | $15.00 | US (Anthropic) | Live |
|
GLM-5
glm-5 |
chat | 202K | cloud | $0.80 | $2.56 | US/EU (DeepInfra) | Live |
|
MiniMax M2.5
minimax-m2 |
chat | 192K | cloud | $0.30 | $1.20 | Singapore (MiniMax) | Live |
|
Qwen 2.5 72B
qwen-72b |
chat | 32K | cloud | $0.12 | $0.39 | US/EU (DeepInfra) | Live |
|
Qwen3 30B-A3B
qwen-coder-32b |
chat | 40K | cloud | $0.08 | $0.28 | US/EU (DeepInfra) | Live |
|
GLM-4.7 Flash
glm-4-flash |
chat | 202K | cloud | $0.06 | $0.40 | US/EU (DeepInfra) | Live |
|
Kimi K2
kimi-k2 |
chat | 131K | cloud | $0.20 | $0.40 | US (Groq) | Live |
|
DeepSeek R1 7B
deepseek-r1-7b |
chat | 32K | local | $0.20 | $0.20 | EU | Live |
|
Mistral Medium
mistral-medium |
chat | 128K | cloud | $0.00 | $0.00 | Mistral AI | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Qwen3 VL 30B
qwen3-vl |
vision | 262K | cloud | $0.15 | $0.60 | US/EU (DeepInfra) | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
BGE-M3
bge-m3 |
embedding | 8K | local | $0.02 | $0.00 | EU | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
BGE Reranker v2-M3
bge-reranker-v2-m3 |
reranker | 8K | local | $0.02 | $0.00 | EU | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Kokoro TTS
kokoro |
tts | - | local | $15.00 | $0.00 | EU | Live |
|
F5-TTS
f5-tts |
tts | - | local | $30.00 | $0.00 | EU | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Whisper Large v3 Turbo
whisper |
transcription | - | local | $6.00 | $0.00 | EU | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
ECAPA-TDNN
ecapa-tdnn |
speaker | - | local | $0.10 | $0.00 | EU | Live |
|
x-vector
xvector |
speaker | - | local | $0.10 | $0.00 | EU | Live |
|
CAM++
cam++ |
speaker | - | local | $0.15 | $0.00 | EU | Live |
|
ResNet293
resnet293 |
speaker | - | local | $0.15 | $0.00 | EU | Live |
|
WavLM+ECAPA
wavlm-base-plus-sv |
speaker | - | local | $0.20 | $0.00 | EU | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Input / 1M | Output / 1M | Regions | Status |
|---|---|---|---|---|---|---|---|
|
CLAP
clap |
audio | - | local | $0.05 | $0.00 | EU | Live |
|
Audio Spectrogram Transformer
ast |
audio | - | local | $0.05 | $0.00 | EU | Live |
|
MERT-330M
mert |
audio | - | local | $0.10 | $0.00 | EU | Live |
Use the pricing tables with the public model catalog and API documentation to decide what your team should test, approve, and scale.