AI orchestrator ranking
Different models are best at different tasks. Orchestrator signals help decide which model should handle writing, reasoning, coding or cost-sensitive work.
What we measure
We compare language quality, instruction following, speed, price and stability without exposing proprietary scoring details.
Why it matters
Models that are strong in English are not always the best choice for Nordic-language work.
Use the data
The benchmark helps you choose the right model for writing, research, agents and API usage.
Price and value
A higher subscription or API price does not automatically produce better Nordic-language output.
Current results
The table below uses the latest available benchmark data for the selected language build.
Method principle
The public view shows actionable scores and signals, while exact tasks and weighting remain proprietary.
Today’s ranking
Independent. No affiliate agreements or commercial ties to AI providers.
| # | Model | Tier | Nordic | Instruction | t/s | Score | Price/1M |
|---|---|---|---|---|---|---|---|
| 1 | Cohere: Command R+ (08-2024)cohere | Premium | 10.0 | 10.0 | — | 8.7 | — |
| 2 | Mistral Large 2407mistralai | Mid-range | 8.0 | 10.0 | — | 8.5 | — |
| 3 | Mistral: Mistral Small 3mistralai | Budget | 6.0 | 10.0 | — | 8.2 | — |
| 4 | Meta: Llama 3.2 3B Instructmeta-llama | Budget | 8.0 | 8.0 | — | 8.1 | — |
| 5 | Anthropic: Claude Haiku 4.5anthropic | Mid-range | 8.0 | 10.0 | — | 7.7 | — |
| 6 | Google: Gemma 3 4Bgoogle | Budget | 6.0 | 10.0 | — | 7.6 | — |
| 7 | Anthropic: Claude Sonnet 4.6anthropic | Premium | 8.0 | 10.0 | — | 7.5 | — |
| 8 | Cohere: Command R7B (12-2024)cohere | Budget | 8.0 | 8.0 | — | 7.4 | — |
Explore more
Over 350 AI models are evaluated every day. The strongest results are presented for Nordic language work.