GPT-5.4 Pro | OpenAI | $30 | $180 | Top-tier reasoning model, Mar 2026 |
GPT-5.4 | OpenAI | $2.5 | $15 | Flagship, Feb 2026 |
GPT-5.4 Mini | OpenAI | $0.75 | $4.5 | Mid-tier 5.4 variant, Mar 2026 |
GPT-5.4 Nano | OpenAI | $0.2 | $1.25 | Efficient 5.4 variant, Mar 2026 |
GPT-5.3 | OpenAI | $1.75 | $14 | Chat-optimized, Mar 2026 |
GPT-5.2 Pro | OpenAI | $10.5 | $84 | Reasoning variant, 400K context (price halved Mar 2026) |
GPT-5.2 | OpenAI | $0.875 | $7 | Dec 2025 flagship (price halved Mar 2026) |
GPT-5.1 | OpenAI | $0.625 | $5 | Coding-optimized (price halved Mar 2026) |
GPT-5 | OpenAI | $1.25 | $10 | Aug 2025 flagship, 400K context |
GPT-5 Mini | OpenAI | $0.25 | $2 | Efficient mid-tier, great value |
GPT-5 Nano | OpenAI | $0.05 | $0.4 | Cheapest OpenAI option |
GPT-4.1 | OpenAI | $2 | $8 | Strong all-rounder, 1M context |
GPT-4.1 Mini | OpenAI | $0.4 | $1.6 | Efficient mid-tier, 1M context |
GPT-4.1 Nano | OpenAI | $0.1 | $0.4 | Fastest & cheapest GPT-4.1 |
o3 | OpenAI | $2 | $8 | Reasoning model, price dropped Mar 2026 |
o3-mini | OpenAI | $1.1 | $4.4 | Affordable reasoning model |
o4-mini | OpenAI | $1.1 | $4.4 | Affordable reasoning model |
o1 | OpenAI | $15 | $60 | Legacy reasoning, high-cost |
Claude Opus 4.6 | Anthropic | $5 | $25 | Most capable Anthropic model |
Claude Sonnet 4.6 | Anthropic | $3 | $15 | Opus-level performance at Sonnet pricing, 1M context |
Claude Haiku 4.5 | Anthropic | $1 | $5 | Fast & efficient, great for routing |
Claude Opus 4.5 | Anthropic | $5 | $25 | Previous flagship, same pricing as 4.6 |
Claude Sonnet 4.5 | Anthropic | $3 | $15 | Previous Sonnet, same pricing as 4.6 |
Claude Sonnet 4 | Anthropic | $3 | $15 | Previous generation Sonnet |
Claude Haiku 3 | Anthropic | $0.25 | $1.25 | Retiring Apr 2026 |
Gemini 3.1 Pro | Google | $2 | $12 | Latest Google flagship, Mar 2026 |
Gemini 3 Flash | Google | $0.5 | $3 | Pro-grade reasoning at Flash speed |
Gemini 3.1 Flash Lite | Google | $0.25 | $1.5 | Cost-efficient 3.1 variant, Mar 2026 |
Gemini 2.5 Pro | Google | $1.25 | $10 | Production-ready, 1M context |
Gemini 2.5 Flash | Google | $0.3 | $2.5 | Capable budget option (repriced Mar 2026) |
Gemini 2.5 Flash-Lite | Google | $0.1 | $0.4 | Cost-efficient, now GA |
Gemini 2.0 Flash-Lite | Google | $0.075 | $0.3 | Cheapest Google model, retiring Jun 2026 |
Grok 4.20 | xAI | $2 | $6 | New flagship, 2M context, Mar 2026 |
Grok 4.1 Fast | xAI | $0.2 | $0.5 | 2M context, very competitive pricing |
Mistral Large 3 | Mistral | $0.5 | $1.5 | 675B params, via Mistral API |
Mistral Medium 3 | Mistral | $0.4 | $2 | Enterprise-grade, 131K context. Via OpenRouter. |
Mistral Small 4 | Mistral | $0.15 | $0.6 | Hybrid reasoning, multimodal, 262K context. Via OpenRouter. |
DeepSeek V3.2 | DeepSeek | $0.28 | $0.42 | Cost-effective API, strong coding/math |
Qwen3 Max | Alibaba | $0.78 | $3.9 | Qwen flagship, 262K context. Via OpenRouter. |
Qwen3.5 Plus | Alibaba | $0.26 | $1.56 | Qwen mid-tier. Via OpenRouter. |
Qwen3.5 397B A17B | Alibaba | $0.39 | $2.34 | Open-weights 397B MoE (17B active), vision-language. Via OpenRouter. |
Qwen3 235B A22B | Alibaba | $0.455 | $1.82 | Open-weights 235B MoE (22B active), Instruct. Via OpenRouter. |
Kimi K2.5 | Moonshot | $0.42 | $2.2 | Strong coding & math, 262K context. Via OpenRouter. |
MiniMax M2.7 | MiniMax | $0.3 | $1.2 | Latest MiniMax flagship. Via OpenRouter. |
MiniMax M2.5 | MiniMax | $0.2 | $1.17 | Previous MiniMax flagship. Via OpenRouter. |
MiniMax M2-Her | MiniMax | $0.3 | $1.2 | 65K context. Via OpenRouter. |
Llama 4 Maverick | Meta | $0.15 | $0.6 | Open-weights 400B MoE (17B active). Via OpenRouter. |
Llama 4 Scout | Meta | $0.08 | $0.3 | Open-weights, efficient Llama 4 variant. Via OpenRouter. |
Arcee Trinity Nano | Arcee AI | N/A (self-hosted) | N/A (self-hosted) | Open-weights 6B MoE (1B active), 128K context. Self-hosted only. |