Cheapest AI API in 2026
The cheapest AI model with strong Nordic-language quality right now is Cohere: Command R7B (12-2024) at $0.04/1M tokens. The signal is based on 30 days of daily evaluation across over 350 models.
| Model | Provider | Context | Price/1M tokens | Price/1000 tokens | 100 documents |
|---|---|---|---|---|---|
| OpenRouter: Fusion | openrouter | 1M | $-1000000.00 | $-1000.0000 | <$0.01 |
| Pareto Code Router | openrouter | 2M | $-1000000.00 | $-1000.0000 | <$0.01 |
| Body Builder (beta) | openrouter | 128k | $-1000000.00 | $-1000.0000 | <$0.01 |
| Auto Router | openrouter | 2M | $-1000000.00 | $-1000.0000 | <$0.01 |
| inclusionAI: Ling-2.6-flash | inclusionai | 262k | $0.01 | $0.0000 | <$0.01 |
| IBM: Granite 4.0 Micro | ibm-granite | 131k | $0.02 | $0.0000 | <$0.01 |
| Meta: Llama 3.1 8B Instruct | meta-llama | 131k | $0.02 | $0.0000 | <$0.01 |
| Mistral: Mistral Nemo | mistralai | 131k | $0.02 | $0.0000 | <$0.01 |
| LiquidAI: LFM2-24B-A2B | liquid | 128k | $0.03 | $0.0000 | <$0.01 |
| OpenAI: gpt-oss-20b | openai | 131k | $0.03 | $0.0000 | <$0.01 |
| Meta: Llama 3.2 1B Instruct | meta-llama | 131k | $0.03 | $0.0000 | <$0.01 |
| OpenAI: gpt-oss-120b | openai | 131k | $0.04 | $0.0000 | <$0.01 |
| Cohere: Command R7B (12-2024) | cohere | 128k | $0.04 | $0.0000 | <$0.01 |
| Amazon: Nova Micro 1.0 | amazon | 128k | $0.04 | $0.0000 | <$0.01 |
| Qwen: Qwen2.5 7B Instruct | qwen | 131k | $0.04 | $0.0000 | <$0.01 |
| Sao10K: Llama 3 8B Lunaris | sao10k | 8k | $0.04 | $0.0000 | <$0.01 |
| IBM: Granite 4.1 8B | ibm-granite | 131k | $0.05 | $0.0001 | $0.01 |
| NVIDIA: Nemotron 3 Nano 30B A3B | nvidia | 262k | $0.05 | $0.0001 | $0.01 |
| Arcee AI: Trinity Mini | arcee-ai | 131k | $0.05 | $0.0001 | $0.01 |
| OpenAI: GPT-5 Nano | openai | 400k | $0.05 | $0.0001 | $0.01 |
| Qwen: Qwen3 30B A3B Instruct 2507 | qwen | 131k | $0.05 | $0.0001 | $0.01 |
| Qwen: Qwen3 8B | qwen | 131k | $0.05 | $0.0001 | $0.01 |
| Google: Gemma 3 4B | 131k | $0.05 | $0.0001 | $0.01 | |
| Google: Gemma 3 12B | 131k | $0.05 | $0.0001 | $0.01 | |
| Mistral: Mistral Small 3 | mistralai | 33k | $0.05 | $0.0001 | $0.01 |
| Meta: Llama 3.2 3B Instruct | meta-llama | 131k | $0.05 | $0.0001 | $0.01 |
| Google: Gemma 4 26B A4B | 262k | $0.06 | $0.0001 | $0.01 | |
| Z.ai: GLM 4.7 Flash | z-ai | 203k | $0.06 | $0.0001 | $0.01 |
| Google: Gemma 3n 4B | 33k | $0.06 | $0.0001 | $0.01 | |
| Amazon: Nova Lite 1.0 | amazon | 300k | $0.06 | $0.0001 | $0.01 |
| MythoMax 13B | gryphe | 4k | $0.06 | $0.0001 | $0.01 |
| Tencent: Hy3 preview | tencent | 262k | $0.07 | $0.0001 | $0.01 |
| Qwen: Qwen3.5-Flash | qwen | 1M | $0.07 | $0.0001 | $0.01 |
| Qwen: Qwen3 Coder 30B A3B Instruct | qwen | 160k | $0.07 | $0.0001 | $0.01 |
| Microsoft: Phi 4 | microsoft | 16k | $0.07 | $0.0001 | $0.01 |
| inclusionAI: Ring-2.6-1T | inclusionai | 262k | $0.08 | $0.0001 | $0.02 |
| inclusionAI: Ling-2.6-1T | inclusionai | 262k | $0.08 | $0.0001 | $0.02 |
| ByteDance Seed: Seed 1.6 Flash | bytedance-seed | 262k | $0.08 | $0.0001 | $0.02 |
| OpenAI: gpt-oss-safeguard-20b | openai | 131k | $0.08 | $0.0001 | $0.02 |
| Microsoft: Phi 4 Mini Instruct | microsoft | 131k | $0.08 | $0.0001 | $0.02 |
| Qwen: Qwen3 VL 8B Instruct | qwen | 256k | $0.08 | $0.0001 | $0.02 |
| Qwen: Qwen3 30B A3B Thinking 2507 | qwen | 131k | $0.08 | $0.0001 | $0.02 |
| Mistral: Mistral Small 3.2 24B | mistralai | 128k | $0.08 | $0.0001 | $0.02 |
| Qwen: Qwen3 32B | qwen | 131k | $0.08 | $0.0001 | $0.02 |
| Google: Gemma 3 27B | 131k | $0.08 | $0.0001 | $0.02 | |
| DeepSeek: DeepSeek V4 Flash | deepseek | 1M | $0.09 | $0.0001 | $0.02 |
| NVIDIA: Nemotron 3 Super | nvidia | 1M | $0.09 | $0.0001 | $0.02 |
| StepFun: Step 3.5 Flash | stepfun | 262k | $0.09 | $0.0001 | $0.02 |
| Qwen: Qwen3 Next 80B A3B Instruct | qwen | 262k | $0.09 | $0.0001 | $0.02 |
| Qwen: Qwen3 235B A22B Instruct 2507 | qwen | 262k | $0.09 | $0.0001 | $0.02 |
| Reka Edge | rekaai | 16k | $0.10 | $0.0001 | $0.02 |
| Qwen: Qwen3.5-9B | qwen | 256k | $0.10 | $0.0001 | $0.02 |
| ByteDance Seed: Seed-2.0-Mini | bytedance-seed | 262k | $0.10 | $0.0001 | $0.02 |
| Mistral: Ministral 3 3B 2512 | mistralai | 131k | $0.10 | $0.0001 | $0.02 |
| Mistral: Voxtral Small 24B 2507 | mistralai | 32k | $0.10 | $0.0001 | $0.02 |
| Qwen: Qwen3 VL 32B Instruct | qwen | 262k | $0.10 | $0.0001 | $0.02 |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | 1M | $0.10 | $0.0001 | $0.02 | |
| Qwen: Qwen3 Next 80B A3B Thinking | qwen | 262k | $0.10 | $0.0001 | $0.02 |
| Qwen: Qwen3 235B A22B Thinking 2507 | qwen | 262k | $0.10 | $0.0001 | $0.02 |
| ByteDance: UI-TARS 7B | bytedance | 128k | $0.10 | $0.0001 | $0.02 |
| Google: Gemini 2.5 Flash Lite | 1M | $0.10 | $0.0001 | $0.02 | |
| Qwen: Qwen3 14B | qwen | 132k | $0.10 | $0.0001 | $0.02 |
| OpenAI: GPT-4.1 Nano | openai | 1M | $0.10 | $0.0001 | $0.02 |
| Meta: Llama 4 Scout | meta-llama | 10M | $0.10 | $0.0001 | $0.02 |
| Reka Flash 3 | rekaai | 66k | $0.10 | $0.0001 | $0.02 |
| Meta: Llama 3.3 70B Instruct | meta-llama | 131k | $0.10 | $0.0001 | $0.02 |
| Qwen: Qwen3 Coder Next | qwen | 262k | $0.11 | $0.0001 | $0.02 |
| Google: Gemma 4 31B | 262k | $0.12 | $0.0001 | $0.02 | |
| Qwen: Qwen3 VL 8B Thinking | qwen | 256k | $0.12 | $0.0001 | $0.02 |
| Qwen: Qwen3 30B A3B | qwen | 131k | $0.12 | $0.0001 | $0.02 |
| Qwen: Qwen3 VL 30B A3B Thinking | qwen | 131k | $0.13 | $0.0001 | $0.03 |
| Qwen: Qwen3 VL 30B A3B Instruct | qwen | 262k | $0.13 | $0.0001 | $0.03 |
| Nous: Hermes 4 70B | nousresearch | 131k | $0.13 | $0.0001 | $0.03 |
| Z.ai: GLM 4.5 Air | z-ai | 131k | $0.13 | $0.0001 | $0.03 |
| Xiaomi: MiMo-V2.5 | xiaomi | 1M | $0.14 | $0.0001 | $0.03 |
| Qwen: Qwen3.5-35B-A3B | qwen | 262k | $0.14 | $0.0001 | $0.03 |
| Tencent: Hunyuan A13B Instruct | tencent | 131k | $0.14 | $0.0001 | $0.03 |
| Meta: Llama 3 8B Instruct | meta-llama | 8k | $0.14 | $0.0001 | $0.03 |
| Perceptron: Perceptron Mk1 | perceptron | 33k | $0.15 | $0.0001 | $0.03 |
| Qwen: Qwen3.6 35B A3B | qwen | 262k | $0.15 | $0.0001 | $0.03 |
| Mistral: Mistral Small 4 | mistralai | 262k | $0.15 | $0.0001 | $0.03 |
| MiniMax: MiniMax M2.5 | minimax | 205k | $0.15 | $0.0001 | $0.03 |
| Upstage: Solar Pro 3 | upstage | 128k | $0.15 | $0.0001 | $0.03 |
| EssentialAI: Rnj 1 Instruct | essentialai | 33k | $0.15 | $0.0001 | $0.03 |
| Mistral: Ministral 3 8B 2512 | mistralai | 262k | $0.15 | $0.0001 | $0.03 |
| AllenAI: Olmo 3 32B Think | allenai | 66k | $0.15 | $0.0001 | $0.03 |
| Meta: Llama 4 Maverick | meta-llama | 1M | $0.15 | $0.0001 | $0.03 |
| OpenAI: GPT-4o-mini Search Preview | openai | 128k | $0.15 | $0.0001 | $0.03 |
| Cohere: Command R (08-2024) | cohere | 128k | $0.15 | $0.0001 | $0.03 |
| OpenAI: GPT-4o-mini | openai | 128k | $0.15 | $0.0001 | $0.03 |
| OpenAI: GPT-4o-mini (2024-07-18) | openai | 128k | $0.15 | $0.0001 | $0.03 |
| TheDrummer: Rocinante 12B | thedrummer | 33k | $0.17 | $0.0002 | $0.03 |
| Meta: Llama Guard 4 12B | meta-llama | 164k | $0.18 | $0.0002 | $0.04 |
| Qwen: Qwen3.6 Flash | qwen | 1M | $0.19 | $0.0002 | $0.04 |
| StepFun: Step 3.7 Flash | stepfun | 256k | $0.20 | $0.0002 | $0.04 |
| OpenAI: GPT-5.4 Nano | openai | 400k | $0.20 | $0.0002 | $0.04 |
| Qwen: Qwen3.5-27B | qwen | 262k | $0.20 | $0.0002 | $0.04 |
| Mistral: Ministral 3 14B 2512 | mistralai | 262k | $0.20 | $0.0002 | $0.04 |
| Prime Intellect: INTELLECT-3 | prime-intellect | 131k | $0.20 | $0.0002 | $0.04 |
| Qwen: Qwen3 VL 235B A22B Instruct | qwen | 262k | $0.20 | $0.0002 | $0.04 |
| Qwen: Qwen3 Coder Flash | qwen | 1M | $0.20 | $0.0002 | $0.04 |
| DeepSeek: DeepSeek V3 0324 | deepseek | 164k | $0.20 | $0.0002 | $0.04 |
| Mistral: Saba | mistralai | 33k | $0.20 | $0.0002 | $0.04 |
| MiniMax: MiniMax-01 | minimax | 1M | $0.20 | $0.0002 | $0.04 |
| DeepSeek: DeepSeek V3 | deepseek | 131k | $0.20 | $0.0002 | $0.04 |
| DeepSeek: DeepSeek V3.1 | deepseek | 164k | $0.21 | $0.0002 | $0.04 |
| Arcee AI: Trinity Large Thinking | arcee-ai | 262k | $0.22 | $0.0002 | $0.04 |
| Qwen: Qwen3 Coder 480B A35B | qwen | 1M | $0.22 | $0.0002 | $0.04 |
| DeepSeek: DeepSeek V3.2 | deepseek | 131k | $0.23 | $0.0002 | $0.05 |
| Google: Gemini 3.1 Flash Lite | 1M | $0.25 | $0.0003 | $0.05 | |
| MiniMax: MiniMax M2.7 | minimax | 205k | $0.25 | $0.0003 | $0.05 |
| ByteDance Seed: Seed-2.0-Lite | bytedance-seed | 262k | $0.25 | $0.0003 | $0.05 |
| Inception: Mercury 2 | inception | 128k | $0.25 | $0.0003 | $0.05 |
| Google: Gemini 3.1 Flash Lite Preview | 1M | $0.25 | $0.0003 | $0.05 | |
| ByteDance Seed: Seed 1.6 | bytedance-seed | 262k | $0.25 | $0.0003 | $0.05 |
| OpenAI: GPT-5.1-Codex-Mini | openai | 400k | $0.25 | $0.0003 | $0.05 |
| OpenAI: GPT-5 Mini | openai | 400k | $0.25 | $0.0003 | $0.05 |
| Anthropic: Claude 3 Haiku | anthropic | 200k | $0.25 | $0.0003 | $0.05 |
| Qwen: Qwen3.5-122B-A10B | qwen | 262k | $0.26 | $0.0003 | $0.05 |
| Qwen: Qwen3.5 Plus 2026-02-15 | qwen | 1M | $0.26 | $0.0003 | $0.05 |
| MiniMax: MiniMax M2 | minimax | 205k | $0.26 | $0.0003 | $0.05 |
| Qwen: Qwen3 VL 235B A22B Thinking | qwen | 131k | $0.26 | $0.0003 | $0.05 |
| Qwen: Qwen Plus 0728 (thinking) | qwen | 1M | $0.26 | $0.0003 | $0.05 |
| Qwen: Qwen Plus 0728 | qwen | 1M | $0.26 | $0.0003 | $0.05 |
| Qwen: Qwen-Plus | qwen | 1M | $0.26 | $0.0003 | $0.05 |
| DeepSeek: DeepSeek V3.2 Exp | deepseek | 164k | $0.27 | $0.0003 | $0.05 |
| DeepSeek: DeepSeek V3.1 Terminus | deepseek | 164k | $0.27 | $0.0003 | $0.05 |
| Qwen: Qwen3.6 27B | qwen | 262k | $0.29 | $0.0003 | $0.06 |
| MiniMax: MiniMax M2.1 | minimax | 205k | $0.29 | $0.0003 | $0.06 |
| MiniMax: MiniMax M3 | minimax | 1M | $0.30 | $0.0003 | $0.06 |
| Qwen: Qwen3.5 Plus 2026-04-20 | qwen | 1M | $0.30 | $0.0003 | $0.06 |
| Kwaipilot: KAT-Coder-Pro V2 | kwaipilot | 262k | $0.30 | $0.0003 | $0.06 |
| MiniMax: MiniMax M2-her | minimax | 66k | $0.30 | $0.0003 | $0.06 |
| Z.ai: GLM 4.6V | z-ai | 131k | $0.30 | $0.0003 | $0.06 |
| Amazon: Nova 2 Lite | amazon | 1M | $0.30 | $0.0003 | $0.06 |
| Google: Nano Banana (Gemini 2.5 Flash Image) | 33k | $0.30 | $0.0003 | $0.06 | |
| TheDrummer: Cydonia 24B V4.1 | thedrummer | 131k | $0.30 | $0.0003 | $0.06 |
| Mistral: Codestral 2508 | mistralai | 256k | $0.30 | $0.0003 | $0.06 |
| Google: Gemini 2.5 Flash | 1M | $0.30 | $0.0003 | $0.06 | |
| Qwen: Qwen3.7 Plus | qwen | 1M | $0.32 | $0.0003 | $0.06 |
| Qwen: Qwen3.6 Plus | qwen | 1M | $0.33 | $0.0003 | $0.07 |
| Mistral: Mistral Small 3.1 24B | mistralai | 128k | $0.35 | $0.0003 | $0.07 |
| Meta: Llama 3.2 11B Vision Instruct | meta-llama | 131k | $0.35 | $0.0003 | $0.07 |
| Qwen2.5 72B Instruct | qwen | 131k | $0.36 | $0.0004 | $0.07 |
| MoonshotAI: Kimi K2.5 | moonshotai | 262k | $0.38 | $0.0004 | $0.08 |
| Qwen: Qwen3.5 397B A17B | qwen | 256k | $0.39 | $0.0004 | $0.08 |
| Z.ai: GLM 4.7 | z-ai | 203k | $0.40 | $0.0004 | $0.08 |
| Mistral: Devstral 2 2512 | mistralai | 262k | $0.40 | $0.0004 | $0.08 |
| NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 | nvidia | 131k | $0.40 | $0.0004 | $0.08 |
| Mistral: Mistral Medium 3.1 | mistralai | 131k | $0.40 | $0.0004 | $0.08 |
| MiniMax: MiniMax M1 | minimax | 1M | $0.40 | $0.0004 | $0.08 |
| Mistral: Mistral Medium 3 | mistralai | 131k | $0.40 | $0.0004 | $0.08 |
| OpenAI: GPT-4.1 Mini | openai | 1M | $0.40 | $0.0004 | $0.08 |
| TheDrummer: UnslopNemo 12B | thedrummer | 33k | $0.40 | $0.0004 | $0.08 |
| Meta: Llama 3.1 70B Instruct | meta-llama | 131k | $0.40 | $0.0004 | $0.08 |
| Baidu: ERNIE 4.5 VL 424B A47B | baidu | 131k | $0.42 | $0.0004 | $0.08 |
| Z.ai: GLM 4.6 | z-ai | 203k | $0.43 | $0.0004 | $0.09 |
| DeepSeek: DeepSeek V4 Pro | deepseek | 1M | $0.44 | $0.0004 | $0.09 |
| Xiaomi: MiMo-V2.5-Pro | xiaomi | 1M | $0.44 | $0.0004 | $0.09 |
| Qwen: Qwen3 235B A22B | qwen | 131k | $0.45 | $0.0004 | $0.09 |
| ReMM SLERP 13B | undi95 | 6k | $0.45 | $0.0004 | $0.09 |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image) | 131k | $0.50 | $0.0005 | $0.10 | |
| NVIDIA: Nemotron 3 Ultra | nvidia | 1M | $0.50 | $0.0005 | $0.10 |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) | 131k | $0.50 | $0.0005 | $0.10 | |
| Google: Gemini 3 Flash Preview | 1M | $0.50 | $0.0005 | $0.10 | |
| Mistral: Mistral Large 3 2512 | mistralai | 262k | $0.50 | $0.0005 | $0.10 |
| DeepSeek: R1 0528 | deepseek | 164k | $0.50 | $0.0005 | $0.10 |
| Arcee AI: Coder Large | arcee-ai | 33k | $0.50 | $0.0005 | $0.10 |
| OpenAI: GPT-3.5 Turbo | openai | 16k | $0.50 | $0.0005 | $0.10 |
| Meta: Llama 3 70B Instruct | meta-llama | 8k | $0.51 | $0.0005 | $0.10 |
| TheDrummer: Skyfall 36B V2 | thedrummer | 33k | $0.55 | $0.0006 | $0.11 |
| MoonshotAI: Kimi K2 0711 | moonshotai | 131k | $0.57 | $0.0006 | $0.11 |
| Z.ai: GLM 5 | z-ai | 203k | $0.60 | $0.0006 | $0.12 |
| Writer: Palmyra X5 | writer | 1M | $0.60 | $0.0006 | $0.12 |
| OpenAI: GPT Audio Mini | openai | 128k | $0.60 | $0.0006 | $0.12 |
| MoonshotAI: Kimi K2 Thinking | moonshotai | 262k | $0.60 | $0.0006 | $0.12 |
| MoonshotAI: Kimi K2 0905 | moonshotai | 262k | $0.60 | $0.0006 | $0.12 |
| Z.ai: GLM 4.5V | z-ai | 66k | $0.60 | $0.0006 | $0.12 |
| Z.ai: GLM 4.5 | z-ai | 131k | $0.60 | $0.0006 | $0.12 |
| WizardLM-2 8x22B | microsoft | 66k | $0.62 | $0.0006 | $0.12 |
| Qwen: Qwen3 Coder Plus | qwen | 1M | $0.65 | $0.0006 | $0.13 |
| Sao10K: Llama 3.3 Euryale 70B | sao10k | 131k | $0.65 | $0.0006 | $0.13 |
| Google: Gemma 2 27B | 8k | $0.65 | $0.0006 | $0.13 | |
| Qwen2.5 Coder 32B Instruct | qwen | 128k | $0.66 | $0.0007 | $0.13 |
| MoonshotAI Kimi Latest | ~moonshotai | 262k | $0.68 | $0.0007 | $0.14 |
| MoonshotAI: Kimi K2.6 | moonshotai | 262k | $0.68 | $0.0007 | $0.14 |
| AionLabs: Aion-1.0-Mini | aion-labs | 131k | $0.70 | $0.0007 | $0.14 |
| DeepSeek: R1 | deepseek | 164k | $0.70 | $0.0007 | $0.14 |
| Nous: Hermes 3 70B Instruct | nousresearch | 131k | $0.70 | $0.0007 | $0.14 |
| MoonshotAI: Kimi K2.7 Code | moonshotai | 262k | $0.74 | $0.0007 | $0.15 |
| OpenAI GPT Mini Latest | ~openai | 400k | $0.75 | $0.0008 | $0.15 |
| OpenAI: GPT-5.4 Mini | openai | 400k | $0.75 | $0.0008 | $0.15 |
| Arcee AI: Virtuoso Large | arcee-ai | 131k | $0.75 | $0.0008 | $0.15 |
| Mancer: Weaver (alpha) | mancer | 8k | $0.75 | $0.0008 | $0.15 |
| Qwen: Qwen3 Max Thinking | qwen | 262k | $0.78 | $0.0008 | $0.16 |
| Qwen: Qwen3 Max | qwen | 262k | $0.78 | $0.0008 | $0.16 |
| AionLabs: Aion-2.0 | aion-labs | 131k | $0.80 | $0.0008 | $0.16 |
| Morph: Morph V3 Fast | morph | 82k | $0.80 | $0.0008 | $0.16 |
| AionLabs: Aion-RP 1.0 (8B) | aion-labs | 33k | $0.80 | $0.0008 | $0.16 |
| Qwen: Qwen2.5 VL 72B Instruct | qwen | 131k | $0.80 | $0.0008 | $0.16 |
| DeepSeek: R1 Distill Llama 70B | deepseek | 128k | $0.80 | $0.0008 | $0.16 |
| Amazon: Nova Pro 1.0 | amazon | 300k | $0.80 | $0.0008 | $0.16 |
| Anthropic: Claude 3.5 Haiku | anthropic | 200k | $0.80 | $0.0008 | $0.16 |
| Relace: Relace Apply 3 | relace | 256k | $0.85 | $0.0008 | $0.17 |
| Switchpoint Router | switchpoint | 131k | $0.85 | $0.0008 | $0.17 |
| Sao10K: Llama 3.1 Euryale 70B v2.2 | sao10k | 131k | $0.85 | $0.0008 | $0.17 |
| Morph: Morph V3 Large | morph | 262k | $0.90 | $0.0009 | $0.18 |
| Z.ai: GLM 5.1 | z-ai | 203k | $0.98 | $0.0010 | $0.20 |
| xAI: Grok Build 0.1 | x-ai | 256k | $1.00 | $0.0010 | $0.20 |
| Anthropic Claude Haiku Latest | ~anthropic | 200k | $1.00 | $0.0010 | $0.20 |
| Relace: Relace Search | relace | 256k | $1.00 | $0.0010 | $0.20 |
| Anthropic: Claude Haiku 4.5 | anthropic | 200k | $1.00 | $0.0010 | $0.20 |
| Nous: Hermes 4 405B | nousresearch | 131k | $1.00 | $0.0010 | $0.20 |
| Perplexity: Sonar | perplexity | 127k | $1.00 | $0.0010 | $0.20 |
| Nous: Hermes 3 405B Instruct | nousresearch | 131k | $1.00 | $0.0010 | $0.20 |
| OpenAI: GPT-3.5 Turbo (older v0613) | openai | 4k | $1.00 | $0.0010 | $0.20 |
| Qwen: Qwen3.6 Max Preview | qwen | 262k | $1.04 | $0.0010 | $0.21 |
| OpenAI: o4 Mini High | openai | 200k | $1.10 | $0.0011 | $0.22 |
| OpenAI: o4 Mini | openai | 200k | $1.10 | $0.0011 | $0.22 |
| OpenAI: o3 Mini High | openai | 200k | $1.10 | $0.0011 | $0.22 |
| OpenAI: o3 Mini | openai | 200k | $1.10 | $0.0011 | $0.22 |
| Z.ai: GLM 5 Turbo | z-ai | 262k | $1.20 | $0.0012 | $0.24 |
| Qwen: Qwen3.7 Max | qwen | 1M | $1.25 | $0.0013 | $0.25 |
| xAI: Grok 4.3 | x-ai | 1M | $1.25 | $0.0013 | $0.25 |
| xAI: Grok 4.20 Multi-Agent | x-ai | 2M | $1.25 | $0.0013 | $0.25 |
| xAI: Grok 4.20 | x-ai | 2M | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5.1-Codex-Max | openai | 400k | $1.25 | $0.0013 | $0.25 |
| Deep Cogito: Cogito v2.1 671B | deepcogito | 128k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5.1 | openai | 400k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5.1 Chat | openai | 128k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5.1-Codex | openai | 400k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5 Codex | openai | 400k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5 Chat | openai | 128k | $1.25 | $0.0013 | $0.25 |
| OpenAI: GPT-5 | openai | 400k | $1.25 | $0.0013 | $0.25 |
| Google: Gemini 2.5 Pro | 1M | $1.25 | $0.0013 | $0.25 | |
| Google: Gemini 2.5 Pro Preview 06-05 | 1M | $1.25 | $0.0013 | $0.25 | |
| Google: Gemini 2.5 Pro Preview 05-06 | 1M | $1.25 | $0.0013 | $0.25 | |
| Z.ai: GLM 5.2 | z-ai | 1M | $1.40 | $0.0014 | $0.28 |
| Google: Gemini 3.5 Flash | 1M | $1.50 | $0.0015 | $0.30 | |
| Mistral: Mistral Medium 3.5 | mistralai | 262k | $1.50 | $0.0015 | $0.30 |
| Google Gemini Flash Latest | 1M | $1.50 | $0.0015 | $0.30 | |
| OpenAI: GPT-3.5 Turbo Instruct | openai | 4k | $1.50 | $0.0015 | $0.30 |
| OpenAI: GPT-5.3 Chat | openai | 128k | $1.75 | $0.0018 | $0.35 |
| OpenAI: GPT-5.3-Codex | openai | 400k | $1.75 | $0.0018 | $0.35 |
| OpenAI: GPT-5.2-Codex | openai | 400k | $1.75 | $0.0018 | $0.35 |
| OpenAI: GPT-5.2 Chat | openai | 128k | $1.75 | $0.0018 | $0.35 |
| OpenAI: GPT-5.2 | openai | 400k | $1.75 | $0.0018 | $0.35 |
| Google: Nano Banana Pro (Gemini 3 Pro Image) | 66k | $2.00 | $0.0020 | $0.40 | |
| Google Gemini Pro Latest | 1M | $2.00 | $0.0020 | $0.40 | |
| Google: Gemini 3.1 Pro Preview Custom Tools | 1M | $2.00 | $0.0020 | $0.40 | |
| Google: Gemini 3.1 Pro Preview | 1M | $2.00 | $0.0020 | $0.40 | |
| Google: Nano Banana Pro (Gemini 3 Pro Image Preview) | 66k | $2.00 | $0.0020 | $0.40 | |
| OpenAI: o4 Mini Deep Research | openai | 200k | $2.00 | $0.0020 | $0.40 |
| AI21: Jamba Large 1.7 | ai21 | 256k | $2.00 | $0.0020 | $0.40 |
| OpenAI: o3 | openai | 200k | $2.00 | $0.0020 | $0.40 |
| OpenAI: GPT-4.1 | openai | 1M | $2.00 | $0.0020 | $0.40 |
| Perplexity: Sonar Reasoning Pro | perplexity | 128k | $2.00 | $0.0020 | $0.40 |
| Perplexity: Sonar Deep Research | perplexity | 128k | $2.00 | $0.0020 | $0.40 |
| Mistral Large 2407 | mistralai | 131k | $2.00 | $0.0020 | $0.40 |
| Mistral: Mixtral 8x22B Instruct | mistralai | 66k | $2.00 | $0.0020 | $0.40 |
| Mistral Large | mistralai | 128k | $2.00 | $0.0020 | $0.40 |
| OpenAI: GPT-5.4 | openai | 1M | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT Audio | openai | 128k | $2.50 | $0.0025 | $0.50 |
| Amazon: Nova Premier 1.0 | amazon | 1M | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT-5 Image Mini | openai | 400k | $2.50 | $0.0025 | $0.50 |
| Cohere: Command A | cohere | 256k | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT-4o Search Preview | openai | 128k | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT-4o (2024-11-20) | openai | 128k | $2.50 | $0.0025 | $0.50 |
| Inflection: Inflection 3 Pi | inflection | 8k | $2.50 | $0.0025 | $0.50 |
| Inflection: Inflection 3 Productivity | inflection | 8k | $2.50 | $0.0025 | $0.50 |
| Cohere: Command R+ (08-2024) | cohere | 128k | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT-4o (2024-08-06) | openai | 128k | $2.50 | $0.0025 | $0.50 |
| OpenAI: GPT-4o | openai | 128k | $2.50 | $0.0025 | $0.50 |
| Anthropic Claude Sonnet Latest | ~anthropic | 1M | $3.00 | $0.0030 | $0.60 |
| Anthropic: Claude Sonnet 4.6 | anthropic | 1M | $3.00 | $0.0030 | $0.60 |
| Perplexity: Sonar Pro Search | perplexity | 200k | $3.00 | $0.0030 | $0.60 |
| Anthropic: Claude Sonnet 4.5 | anthropic | 1M | $3.00 | $0.0030 | $0.60 |
| Anthropic: Claude Sonnet 4 | anthropic | 1M | $3.00 | $0.0030 | $0.60 |
| Perplexity: Sonar Pro | perplexity | 200k | $3.00 | $0.0030 | $0.60 |
| Sao10K: Llama 3.1 70B Hanami x1 | sao10k | 16k | $3.00 | $0.0030 | $0.60 |
| Magnum v4 72B | anthracite-org | 33k | $3.00 | $0.0030 | $0.60 |
| OpenAI: GPT-3.5 Turbo 16k | openai | 16k | $3.00 | $0.0030 | $0.60 |
| AionLabs: Aion-1.0 | aion-labs | 131k | $4.00 | $0.0040 | $0.80 |
| Anthropic: Claude Opus 4.8 | anthropic | 1M | $5.00 | $0.0050 | $1.00 |
| OpenAI: GPT Chat Latest | openai | 400k | $5.00 | $0.0050 | $1.00 |
| OpenAI GPT Latest | ~openai | 1M | $5.00 | $0.0050 | $1.00 |
| OpenAI: GPT-5.5 | openai | 1M | $5.00 | $0.0050 | $1.00 |
| Anthropic: Claude Opus Latest | ~anthropic | 1M | $5.00 | $0.0050 | $1.00 |
| Anthropic: Claude Opus 4.7 | anthropic | 1M | $5.00 | $0.0050 | $1.00 |
| Anthropic: Claude Opus 4.6 | anthropic | 1M | $5.00 | $0.0050 | $1.00 |
| Anthropic: Claude Opus 4.5 | anthropic | 200k | $5.00 | $0.0050 | $1.00 |
| OpenAI: GPT-4o (2024-05-13) | openai | 128k | $5.00 | $0.0050 | $1.00 |
| OpenAI: GPT-5.4 Image 2 | openai | 272k | $8.00 | $0.0080 | $1.60 |
| Anthropic: Claude Fable Latest | ~anthropic | 1M | $10.00 | $0.0100 | $2.00 |
| Anthropic: Claude Fable 5 | anthropic | 1M | $10.00 | $0.0100 | $2.00 |
| Anthropic: Claude Opus 4.8 (Fast) | anthropic | 1M | $10.00 | $0.0100 | $2.00 |
| OpenAI: GPT-5 Image | openai | 400k | $10.00 | $0.0100 | $2.00 |
| OpenAI: o3 Deep Research | openai | 200k | $10.00 | $0.0100 | $2.00 |
| OpenAI: GPT-4 Turbo | openai | 128k | $10.00 | $0.0100 | $2.00 |
| OpenAI: GPT-4 Turbo Preview | openai | 128k | $10.00 | $0.0100 | $2.00 |
| OpenAI: GPT-5 Pro | openai | 400k | $15.00 | $0.0150 | $3.00 |
| Anthropic: Claude Opus 4.1 | anthropic | 200k | $15.00 | $0.0150 | $3.00 |
| Anthropic: Claude Opus 4 | anthropic | 200k | $15.00 | $0.0150 | $3.00 |
| OpenAI: o1 | openai | 200k | $15.00 | $0.0150 | $3.00 |
| OpenAI: o3 Pro | openai | 200k | $20.00 | $0.0200 | $4.00 |
| OpenAI: GPT-5.2 Pro | openai | 400k | $21.00 | $0.0210 | $4.20 |
| Anthropic: Claude Opus 4.7 (Fast) | anthropic | 1M | $30.00 | $0.0300 | $6.00 |
| OpenAI: GPT-5.5 Pro | openai | 1M | $30.00 | $0.0300 | $6.00 |
| Anthropic: Claude Opus 4.6 (Fast) | anthropic | 1M | $30.00 | $0.0300 | $6.00 |
| OpenAI: GPT-5.4 Pro | openai | 1M | $30.00 | $0.0300 | $6.00 |
| OpenAI: GPT-4 | openai | 8k | $30.00 | $0.0300 | $6.00 |
| OpenAI: o1-pro | openai | 200k | $150.00 | $0.1500 | $30.00 |
All prices refer to input tokens via API. Output tokens typically cost 3–5× more. Prices change frequently — always verify directly with the provider.
Frequently asked questions about AI API costs
What is an API and who needs it?
An API lets you use AI models directly in your own software, automated workflows and apps. You pay per token instead of a fixed subscription.
Is cheap API quality as good as expensive API?
Often, yes. For many language and everyday tasks, low-cost models can deliver very strong value. Choose based on real results, not price alone.
Which model gives the best value for money?
It depends on the task. Compare quality, price, response time, stability and data requirements together. Test on your own content before standardising.
What are tokens?
Tokens are pieces of text that the model reads and writes. Prices are often stated per 1 million tokens. At high volume and with long documents, price differences matter.
← Back to home to see recommendations for other use cases.