Models
This page is generated from the model catalog during the docs build.
Total active models listed: 62.
LLM (Text & Chat)
Section titled “LLM (Text & Chat)”Endpoint: POST /v1/chat/completions
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
qwen3-coder-plus | Qwen 3 Coder Plus | Alibaba | $0.8 in / $2 out per 1M tokens | 131K | chat, tool_use, reasoning |
qwen3-max | Qwen 3 Max | Alibaba | $1.6 in / $6.4 out per 1M tokens | 262K | chat, tool_use, vision, reasoning |
qwen3-vl-plus | Qwen 3 VL Plus | Alibaba | $0.8 in / $2 out per 1M tokens | 131K | chat, vision, tool_use |
qwen3.5-flash | Qwen 3.5 Flash | Alibaba | $0.05 in / $0.1 out per 1M tokens | 1M | chat, tool_use, vision |
qwen-3.5-cloud | Qwen 3.5 Plus | Alibaba | $0.3 in / $0.6 out per 1M tokens | 131K | chat, tool_use, vision, reasoning |
qwen3.6-plus | Qwen 3.6 Plus | Alibaba | $0.8 in / $2 out per 1M tokens | 1M | chat, tool_use, vision, reasoning |
qwen-turbo | Qwen Turbo | Alibaba | $0.05 in / $0.1 out per 1M tokens | 131K | chat, tool_use |
qwen-vl-max | Qwen VL Max | Alibaba | $2.4 in / $6.4 out per 1M tokens | 131K | chat, vision, tool_use |
claude-haiku | Claude Haiku 4.5 | Anthropic | $0.8 in / $4 out per 1M tokens | 200K | chat, tool_use, vision |
claude-opus | Claude Opus 4.6 | Anthropic | $15 in / $75 out per 1M tokens | 1M | chat, tool_use, vision, reasoning |
claude-sonnet-4 | Claude Sonnet 4 | Anthropic | $3 in / $15 out per 1M tokens | 200K | chat |
claude-sonnet | Claude Sonnet 4.6 | Anthropic | $3 in / $15 out per 1M tokens | 1M | chat, tool_use, vision, reasoning |
deepseek-v3.2 | DeepSeek V3.2 | DeepSeek | $0.27 in / $1.1 out per 1M tokens | 131K | chat, reasoning |
deepseek-v3.2-azure | DeepSeek V3.2 | DeepSeek | $0.58 in / $1.68 out per 1M tokens | 131K | chat, tool_use, code |
gemini-3-flash | Gemini 3 Flash | $0.15 in / $0.6 out per 1M tokens | 1M | chat, tool_use, vision | |
gemma4-26b | Gemma 4 26B | $0.07 in / $0.15 out per 1M tokens | 262K | chat, vision, video, tool_use | |
gemma4-31b | Gemma 4 31B | $0.15 in / $0.4 out per 1M tokens | 262K | chat, vision, video, tool_use, reasoning | |
llama-4-maverick | Llama 4 Maverick | Meta | $0.25 in / $1 out per 1M tokens | 131K | chat, vision |
llama-4-scout | Llama 4 Scout | Meta | $0.17 in / $0.17 out per 1M tokens | 524K | chat, tool_use, vision |
mistral-large-3 | Mistral Large 3 | Mistral | $0.5 in / $1.5 out per 1M tokens | 131K | chat, tool_use, code |
mistral-small-azure | Mistral Small | Mistral | $0.3 in / $1.2 out per 1M tokens | 131K | chat, tool_use |
kimi-k2.5 | Kimi K2.5 | Moonshot AI | $0.77 in / $3.44 out per 1M tokens | 262K | chat, tool_use, vision, reasoning |
kimi-k2.5-bedrock | Kimi K2.5 (Bedrock) | Moonshot AI | $0.6 in / $2.5 out per 1M tokens | 256K | chat, reasoning |
nemotron-3-super-120b | Nemotron 3 Super 120B | NVIDIA | $1.3 in / $2 out per 1M tokens | 131K | chat, reasoning, code, tool_use |
gpt-5.3 | GPT-5.3 | OpenAI | $2.5 in / $10 out per 1M tokens | 131K | chat, tool_use, vision, reasoning |
gpt-5.4 | GPT-5.4 | OpenAI | $2.5 in / $15 out per 1M tokens | 131K | chat, tool_use, vision, reasoning |
gpt-5.4-mini | GPT-5.4 Mini | OpenAI | $0.75 in / $4.5 out per 1M tokens | 1M | chat, tool_use, vision |
gpt-5.4-nano | GPT-5.4 Nano | OpenAI | $0.2 in / $1.25 out per 1M tokens | 131K | chat, tool_use |
grok-4.1-fast | Grok 4.1 Fast | xAI | $0.4 in / $1 out per 1M tokens | 131K | chat, tool_use |
grok-4.1-reasoning | Grok 4.1 Reasoning | xAI | $0.4 in / $1 out per 1M tokens | 131K | chat, tool_use, reasoning |
Image Generation
Section titled “Image Generation”Endpoint: POST /v1/images/generations
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
wan2.7-image | Wan 2.7 Image | Alibaba | $0.02 in / $0 out per 1M tokens | - | image_generation |
wan2.7-image-pro | Wan 2.7 Image Pro | Alibaba | $0.04 in / $0 out per 1M tokens | - | image_generation |
flux-schnell | FLUX Schnell | Black Forest Labs | $0.003 in / $0 out per 1M tokens | - | image_generation |
flux2-flex | FLUX.2 Flex | Black Forest Labs | See pricing | - | image_generation, image_edit |
flux2-pro | FLUX.2 Pro | Black Forest Labs | See pricing | - | image_generation, image_edit |
gemini-image | Gemini 3 Pro Image | $0.15 in / $0.6 out per 1M tokens | 1M | image_generation, vision | |
gemini-3-flash-image | Gemini 3.1 Flash Image | $0.15 in / $0.6 out per 1M tokens | 1M | chat, vision, image_generation | |
imagen-4 | Imagen 4 | See pricing | - | image_generation | |
gpt-image-1.5 | GPT Image 1.5 | OpenAI | $5 in / $40 out per 1M tokens | - | image_generation, image_edit |
recraft-v4 | Recraft V4 Pro | Recraft AI | See pricing | - | image_generation |
Video Generation
Section titled “Video Generation”Endpoint: POST /v1/videos/generations
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
wan-2.6-i2v | Wan 2.6 Image to Video | Alibaba | See pricing | - | video_generation, image_to_video |
wan-2.6-t2v | Wan 2.6 Text to Video | Alibaba | See pricing | - | video_generation, text_to_video |
seedance-1.5-i2v | Seedance 1.5 Pro Image to Video | ByteDance | See pricing | - | video_generation, image_to_video |
seedance-1.5-t2v | Seedance 1.5 Pro Text to Video | ByteDance | See pricing | - | video_generation |
seedance-2-i2v | Seedance 2.0 Image to Video | ByteDance | See pricing | 131K | video |
seedance-2-t2v | Seedance 2.0 Text to Video | ByteDance | See pricing | 131K | video |
veo-3.1-i2v | Veo 3.1 Image to Video | See pricing | - | video_generation, image_to_video | |
veo-3.1-t2v | Veo 3.1 Text to Video | See pricing | - | video_generation | |
kling-3-i2v | Kling 3 Image to Video | Kuaishou | See pricing | - | video_generation, image_to_video |
kling-3-t2v | Kling 3 Text to Video | Kuaishou | See pricing | - | video_generation, text_to_video |
Speech-to-Text
Section titled “Speech-to-Text”Endpoint: POST /v1/audio/transcriptions
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
faster-whisper-v3 | Faster-Whisper V3 | Community | See pricing | - | stt, turkish |
whisper-large-v3 | Whisper Large V3 | OpenAI | See pricing | - | stt, turkish |
whisper-large-v3-turbo | Whisper V3 Turbo | OpenAI | $0.006 in / $0 out per 1M tokens | - | stt, turkish |
Text-to-Speech
Section titled “Text-to-Speech”Endpoint: POST /v1/audio/speech
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
elevenlabs-multilingual | ElevenLabs Turbo v3 | ElevenLabs | See pricing | - | tts, turkish, voice_clone, multilingual |
azure-tts | Azure TTS | Microsoft | $0.6 in / $12 out per 1M tokens | - | text_to_speech |
chatterbox-multilingual | Chatterbox Multilingual | Resemble AI | See pricing | - | tts, turkish, voice_clone |
Embeddings
Section titled “Embeddings”Endpoint: POST /v1/embeddings
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
qwen-embedding-v3 | Qwen Embedding v3 | Alibaba | $0.007 in / $0 out per 1M tokens | 8K | embedding |
titan-embed-v2 | Amazon Titan Embed v2 | Amazon | $0.003 in / $0 out per 1M tokens | 8K | embedding |
gemini-embedding-2 | Gemini Embedding 2 | $0.015 in / $0 out per 1M tokens | 8K | embedding | |
text-embedding-3-large | Text Embedding 3 Large | OpenAI | $0.13 in / $0 out per 1M tokens | 8K | embeddings |
text-embedding-3-small | Text Embedding 3 Small | OpenAI | $0.02 in / $0 out per 1M tokens | 8K | embeddings |
Music Generation
Section titled “Music Generation”Endpoint: POST /v1/audio/music
| Model ID | Display name | Vendor | Pricing | Context | Capabilities |
|---|---|---|---|---|---|
elevenlabs-music | ElevenLabs Music | ElevenLabs | See pricing | 131K | music_generation |