Artificial Analysis Benchmark
223 models · 101 linked to model directory · Updated May 23, 2026 at 01:29 PM
| Model | Provider | Intelligence | Context | Price ($/M tok) | Speed (tok/s) | Latency (s) | Matched |
|---|---|---|---|---|---|---|---|
| GPT-5.5 (xhigh) | OpenAI | 60 | 922k | $4.35 | 72 | 117.8 | openai |
| GPT-5.5 (high) | OpenAI | 59 | 922k | $4.35 | 67 | 34.7 | openai |
| Claude Opus 4.7 (max) | Anthropic | 57 | 1M | $4.10 | 50 | 12.3 | anthropic |
| Gemini 3.1 Pro Preview | 57 | 1M | $1.74 | 130 | 29.0 | ||
| GPT-5.5 (medium) | OpenAI | 57 | 922k | $4.35 | 61 | 4.1 | openai |
| Qwen3.7 Max | Alibaba | 57 | 1M | $1.43 | 195 | 2.5 | alibaba |
| Gemini 3.5 Flash | 55 | 1M | $1.31 | 198 | 20.9 | ||
| Kimi K2.6 | Kimi | 54 | 256k | $0.70 | 47 | 2.3 | moonshotai |
| MiMo-V2.5-Pro | Xiaomi | 54 | 1M | $0.71 | 60 | 2.9 | xiaomi |
| GPT-5.3 Codex (xhigh) | OpenAI | 54 | 400k | $1.87 | 76 | 92.3 | openai |
| Grok 4.3 (high) | xAI | 53 | 1M | $0.64 | 92 | 15.3 | xai |
| Muse Spark | Meta | 52 | 262k | - | - | - | — |
| Claude Opus 4.7 (Non-reasoning, high) | Anthropic | 52 | 1M | $4.10 | 47 | 1.7 | anthropic |
| Claude Sonnet 4.6 (max) | Anthropic | 52 | 1M | $2.46 | 53 | 80.0 | — |
| DeepSeek V4 Pro (Max) | DeepSeek | 52 | 1M | $0.18 | 33 | 1.8 | deepseek |
| GLM-5.1 | Z AI | 51 | 200k | $0.90 | 59 | 1.3 | zai |
| GPT-5.5 (low) | OpenAI | 51 | 922k | $4.35 | 66 | 2.1 | openai |
| Qwen3.6 Plus | Alibaba | 50 | 1M | $0.43 | 53 | 3.1 | alibaba |
| DeepSeek V4 Pro (High) | DeepSeek | 50 | 1M | $0.18 | 33 | 1.8 | deepseek |
| GLM-5 | Z AI | 50 | 200k | $0.66 | 68 | 1.6 | zai |
| MiniMax-M2.7 | MiniMax | 50 | 205k | $0.22 | 54 | 2.1 | minimax |
| MiMo-V2.5 | Xiaomi | 49 | 1M | $0.39 | 97 | 3.7 | — |
| GPT-5.4 mini (xhigh) | OpenAI | 49 | 400k | $0.65 | 160 | 7.4 | openai |
| Grok 4.3 (medium) | xAI | 49 | 1M | $0.64 | 86 | 16.9 | xai |
| GPT-5.4 (low) | OpenAI | 48 | 1.05M | $2.17 | 73 | 1.9 | openai |
| GLM-5-Turbo | Z AI | 47 | 200k | - | - | - | zai |
| DeepSeek V4 Flash (Max) | DeepSeek | 47 | 1M | $0.06 | 109 | 1.3 | deepseek |
| DeepSeek V4 Flash (High) | DeepSeek | 46 | 1M | $0.08 | - | - | deepseek |
| Qwen3.6 27B | Alibaba | 46 | 262k | $0.90 | 64 | 3.8 | alibaba |
| Qwen3.5 397B A17B | Alibaba | 45 | 262k | $0.90 | 52 | 2.6 | alibaba |
| MiMo-V2-Omni-0327 | Xiaomi | 45 | 256k | $0.34 | 104 | 2.5 | — |
| Claude Sonnet 4.6 (Non-reasoning) | Anthropic | 44 | 1M | $2.46 | 48 | 1.3 | anthropic |
| GPT-5.4 nano (xhigh) | OpenAI | 44 | 400k | $0.18 | 152 | 5.0 | openai |
| Grok 4.3 (low) | xAI | 44 | 1M | $0.64 | 78 | 8.8 | xai |
| GLM-5.1 | Z AI | 44 | 200k | $0.90 | 47 | 1.8 | zai |
| Qwen3.6 35B A3B | Alibaba | 43 | 262k | $0.37 | 191 | 2.5 | alibaba |
| MiMo-V2-Omni | Xiaomi | 43 | 256k | $0.00 | 108 | 2.4 | xiaomi |
| Gemini 3.5 Flash (minimal) | 43 | 1M | $1.31 | 204 | 0.9 | — | |
| Kimi K2.6 | Kimi | 43 | 256k | $0.70 | 69 | 2.2 | moonshotai |
| GLM 5V Turbo | Z AI | 43 | 200k | - | - | - | zai |
| Claude Sonnet 4.6 (Non-reasoning, Low Effort) | Anthropic | 43 | 1M | $2.46 | 47 | 1.4 | — |
| Hy3-preview | Tencent | 42 | 256k | $0.14 | 96 | 3.9 | — |
| Qwen3.5 122B A10B | Alibaba | 42 | 262k | $0.68 | 130 | 2.5 | alibaba |
| MiMo-V2-Flash (Feb 2026) | Xiaomi | 41 | 256k | $0.06 | 133 | 2.5 | — |
| Gemini 3 Pro Preview (low) | 41 | 1M | $1.74 | - | - | opencode | |
| GPT-5.5 (Non-reasoning) | OpenAI | 41 | 922k | $4.35 | 61 | 1.0 | openai |
| GLM-5 | Z AI | 41 | 200k | $0.66 | 53 | 2.2 | zai |
| Qwen3.5 397B A17B | Alibaba | 40 | 262k | $0.90 | 53 | 2.8 | alibaba |
| DeepSeek V4 Pro | DeepSeek | 39 | 1M | $0.18 | 32 | 1.9 | deepseek |
| Mistral Medium 3.5 | Mistral | 39 | 256k | $2.10 | 132 | 1.7 | — |
| Gemma 4 31B | 39 | 256k | $0.00 | 35 | 1.6 | nano-gpt | |
| Qwen3.5 Omni Plus | Alibaba | 39 | 256k | $0.84 | 53 | 2.4 | nano-gpt |
| Grok 4.1 Fast | xAI | 39 | 2M | - | - | - | frogbot |
| Step 3.5 Flash 2603 | StepFun | 38 | 256k | $0.00 | 190 | 1.1 | stepfun |
| Ring-2.6-1T | InclusionAI | 38 | 262k | - | - | - | — |
| o3 | OpenAI | 38 | 200k | $1.55 | 99 | 10.0 | openai |
| GPT-5.4 nano | OpenAI | 38 | 400k | $0.18 | 143 | 4.8 | openai |
| GPT-5.4 mini (medium) | OpenAI | 38 | 400k | $0.65 | 161 | 8.0 | openai |
| Kimi K2.5 | Kimi | 37 | 256k | $0.49 | 35 | 3.0 | moonshotai |
| Command A+ | Cohere | 37 | 192k | $0.00 | 205 | 0.3 | — |
| Qwen3.6 27B | Alibaba | 37 | 262k | $0.90 | 66 | 3.8 | alibaba |
| Claude 4.5 Haiku | Anthropic | 37 | 200k | $0.82 | 95 | 15.6 | — |
| DeepSeek V4 Flash | DeepSeek | 36 | 1M | $0.06 | 132 | 1.2 | deepseek |
| JT-35B-Flash | China Mobile | 36 | 256k | - | - | - | — |
| NVIDIA Nemotron 3 Super | NVIDIA | 36 | 1M | $0.28 | 173 | 1.2 | nvidia |
| Qwen3.5 122B A10B | Alibaba | 36 | 262k | $0.68 | 122 | 2.5 | alibaba |
| Nova 2.0 Pro Preview (medium) | Amazon | 36 | 256k | $1.47 | 132 | 13.1 | — |
| MiMo-V2.5-Pro | Xiaomi | 36 | 1M | $0.71 | 64 | 2.9 | xiaomi |
| GPT-5.4 (Non-reasoning) | OpenAI | 35 | 1.05M | $2.17 | 73 | 0.8 | openai |
| Gemini 3 Flash | 35 | 1M | $0.43 | 171 | 0.9 | opencode | |
| Gemini 2.5 Pro | 35 | 1M | $1.34 | 137 | 24.3 | ||
| Nova 2.0 Lite (high) | Amazon | 35 | 1M | $0.52 | 150 | 18.9 | — |
| Hy3-preview | Tencent | 34 | 256k | $0.14 | 90 | 4.1 | — |
| Ling-2.6-1T | InclusionAI | 34 | 262k | $0.52 | - | - | — |
| Doubao Seed Code | ByteDance Seed | 34 | 256k | - | - | - | — |
| Gemini 3.1 Flash-Lite Preview | 34 | 1M | $0.22 | 334 | 5.2 | ||
| gpt-oss-120b (high) | OpenAI | 33 | 131k | $0.20 | 291 | 0.9 | dinference |
| Mercury 2 | Inception | 33 | 128k | $0.14 | 855 | 4.9 | inception |
| Qwen3.5 9B | Alibaba | 32 | 262k | $0.11 | 49 | 0.6 | venice |
| Gemma 4 31B | 32 | 256k | $0.17 | 21 | 2.1 | nano-gpt | |
| K-EXAONE | LG AI Research | 32 | 256k | - | - | - | — |
| Grok 3 mini Reasoning (high) | xAI | 32 | 1M | $0.16 | 56 | 0.7 | — |
| Nova 2.0 Pro Preview (low) | Amazon | 32 | 256k | $2.13 | 139 | 7.4 | — |
| Trinity Large Thinking | Arcee AI | 32 | 512k | $0.24 | 138 | 1.1 | — |
| Qwen3.6 35B A3B | Alibaba | 32 | 262k | $0.56 | 176 | 2.5 | alibaba |
| Gemma 4 26B A4B | 31 | 256k | $0.14 | - | - | — | |
| Claude 4.5 Haiku | Anthropic | 31 | 200k | $0.82 | 99 | 0.9 | helicone |
| Grok 4.3 | xAI | 31 | 1M | $0.64 | 84 | 0.7 | xai |
| Qwen3.5 35B A3B | Alibaba | 31 | 262k | $0.42 | 173 | 2.2 | alibaba |
| MiMo-V2-Flash | Xiaomi | 30 | 256k | $0.12 | 130 | 2.4 | xiaomi |
| EXAONE 4.5 33B | LG AI Research | 30 | 262k | - | - | - | — |
| Nova 2.0 Lite (medium) | Amazon | 30 | 1M | $0.52 | 143 | 19.3 | — |
| ERNIE 5.0 Thinking Preview | Baidu | 29 | 128k | - | - | - | nano-gpt |
| Grok 4.20 0309 v2 | xAI | 29 | 2M | $1.14 | 106 | 0.7 | venice |
| Grok Code Fast 1 | xAI | 29 | 256k | - | - | - | frogbot |
| Nemotron Cascade 2 30B A3B | NVIDIA | 28 | 1M | - | - | - | — |
| Qwen3 Coder Next | Alibaba | 28 | 256k | $0.43 | 113 | 1.6 | llmgateway |
| Nova 2.0 Omni (medium) | Amazon | 28 | 1M | $0.52 | - | - | — |
| Mistral Small 4 | Mistral | 28 | 256k | $0.20 | 149 | 0.7 | — |
| Qwen3.5 9B | Alibaba | 27 | 262k | - | - | - | venice |
| Magistral Medium 1.2 | Mistral | 27 | 128k | $2.30 | 40 | 1.7 | — |
| Gemma 4 26B A4B | 27 | 256k | $0.16 | 81 | 1.3 | — | |
| Qwen3.5 4B | Alibaba | 27 | 262k | $0.04 | 169 | 0.5 | — |
| DeepSeek R1 0528 | DeepSeek | 27 | 128k | $1.64 | - | - | nano-gpt |
| Qwen3 Next 80B A3B | Alibaba | 27 | 262k | $1.05 | 150 | 2.2 | — |
| Ling 2.6 Flash | InclusionAI | 26 | 262k | $0.06 | - | - | — |
| Solar Pro 3 | Upstage | 26 | 128k | - | - | - | upstage |
| Qwen3.5 Omni Flash | Alibaba | 26 | 256k | $0.17 | 264 | 2.0 | nano-gpt |
| JT-MINI | China Mobile | 25 | 128k | - | - | - | — |
| Nova 2.0 Lite (low) | Amazon | 25 | 1M | $0.52 | 164 | 11.9 | — |
| gpt-oss-20B (high) | OpenAI | 24 | 131k | $0.07 | 243 | 0.8 | frogbot |
| gpt-oss-120b (low) | OpenAI | 24 | 131k | $0.20 | 333 | 0.8 | dinference |
| GPT-5.4 nano | OpenAI | 24 | 400k | $0.18 | 152 | 0.7 | openai |
| NVIDIA Nemotron 3 Nano | NVIDIA | 24 | 1M | $0.07 | 150 | 1.4 | — |
| LongCat Flash Lite | LongCat | 24 | 256k | $0.00 | 110 | 8.4 | — |
| Grok 4.1 Fast | xAI | 24 | 2M | - | - | - | — |
| K-EXAONE | LG AI Research | 23 | 256k | - | - | - | — |
| GPT-5.4 mini | OpenAI | 23 | 400k | $0.65 | 163 | 0.7 | openai |
| Nova 2.0 Omni (low) | Amazon | 23 | 1M | $0.52 | - | - | — |
| Nova 2.0 Pro Preview | Amazon | 23 | 256k | $2.13 | 117 | 1.2 | — |
| Mi:dm K 2.5 Pro | Korea Telecom | 23 | 128k | - | - | - | — |
| Mistral Large 3 | Mistral | 23 | 256k | $0.60 | 51 | 1.1 | — |
| Ring-1T | InclusionAI | 23 | 128k | - | - | - | bailing |
| Qwen3.5 4B | Alibaba | 23 | 262k | $0.04 | 166 | 0.5 | — |
| INTELLECT-3 | Prime Intellect | 22 | 131k | - | - | - | cortecs |
| Devstral 2 | Mistral | 22 | 256k | $0.00 | 55 | 1.3 | — |
| Solar Open 100B | Upstage | 22 | 128k | - | - | - | — |
| Gemini 2.5 Flash-Lite (Sep) | 22 | 1M | $0.07 | - | - | — | |
| Nemotron 3 Nano Omni 30B A3B Reasoning | NVIDIA | 21 | 256k | $0.10 | 306 | 1.1 | — |
| gpt-oss-20B (low) | OpenAI | 21 | 131k | $0.07 | 237 | 0.8 | frogbot |
| Qwen3 Next 80B A3B | Alibaba | 20 | 262k | $0.65 | 151 | 2.4 | alibaba |
| Devstral Small 2 | Mistral | 19 | 256k | $0.00 | 55 | 1.1 | — |
| Gemini 2.5 Flash-Lite (Sep) | 19 | 1M | $0.07 | - | - | nano-gpt | |
| Motif-2-12.7B | Motif Technologies | 19 | 128k | - | - | - | — |
| Ling-1T | InclusionAI | 19 | 128k | - | - | - | bailing |
| Nova Premier | Amazon | 19 | 1M | $2.18 | 34 | 2.9 | — |
| Gemma 4 E4B | 19 | 128k | - | - | - | — | |
| Llama Nemotron Super 49B v1.5 | NVIDIA | 19 | 128k | $0.13 | 49 | 1.4 | — |
| Mistral Small 4 | Mistral | 19 | 256k | $0.20 | 143 | 0.9 | — |
| Llama 3.3 Nemotron Super 49B | NVIDIA | 18 | 128k | - | - | - | — |
| Llama 4 Maverick | Meta | 18 | 1M | $0.34 | 111 | 1.1 | digitalocean |
| Magistral Small 1.2 | Mistral | 18 | 128k | $0.60 | 106 | 0.8 | — |
| Sarvam 105B (high) | Sarvam | 18 | 128k | $0.00 | 95 | 2.2 | sarvam |
| Nova 2.0 Lite | Amazon | 18 | 1M | $0.52 | 164 | 1.3 | — |
| Llama 3.1 405B | Meta | 17 | 128k | $3.13 | 54 | 2.4 | — |
| EXAONE 4.0 32B | LG AI Research | 17 | 131k | - | - | - | — |
| Nova 2.0 Omni | Amazon | 17 | 1M | $0.52 | - | - | — |
| Qwen3.5 2B | Alibaba | 16 | 262k | $0.03 | - | - | — |
| Nanbeige4.1-3B | Nanbeige | 16 | 256k | - | - | - | — |
| Ministral 3 14B | Mistral | 16 | 256k | $0.20 | 72 | 0.8 | ollama-cloud |
| DeepSeek R1 Distill Llama 70B | DeepSeek | 16 | 128k | $0.73 | 43 | 1.7 | alibaba-cn |
| Falcon-H1R-7B | TII UAE | 16 | 256k | - | - | - | — |
| Ling-flash-2.0 | InclusionAI | 16 | 128k | $0.18 | 82 | 2.5 | — |
| Qwen3 Omni 30B A3B | Alibaba | 16 | 65.5k | $0.32 | 88 | 2.1 | — |
| Step3 VL 10B | StepFun | 15 | 65.5k | - | - | - | — |
| Gemma 4 E2B | 15 | 128k | - | - | - | — | |
| Llama Nemotron Ultra | NVIDIA | 15 | 128k | $0.72 | 45 | 2.5 | — |
| ERNIE 4.5 300B A47B | Baidu | 15 | 131k | $0.36 | 24 | 3.5 | — |
| Solar Pro 2 | Upstage | 15 | 65.5k | - | - | - | — |
| NVIDIA Nemotron Nano 12B v2 VL | NVIDIA | 15 | 128k | $0.24 | - | - | — |
| Ministral 3 8B | Mistral | 15 | 256k | $0.15 | 95 | 0.7 | ollama-cloud |
| Gemma 4 E4B | 15 | 128k | - | - | - | — | |
| NVIDIA Nemotron Nano 9B V2 | NVIDIA | 15 | 131k | $0.05 | 117 | 0.7 | — |
| Granite 4.1 30B | IBM | 15 | 131k | - | - | - | — |
| NVIDIA Nemotron 3 Nano 4B | NVIDIA | 15 | 262k | - | - | - | — |
| Qwen3.5 2B | Alibaba | 15 | 262k | $0.03 | 325 | 0.4 | — |
| Llama Nemotron Super 49B v1.5 | NVIDIA | 15 | 128k | $0.13 | 48 | 1.3 | — |
| Llama 3.3 70B | Meta | 14 | 128k | $0.60 | 77 | 1.8 | — |
| Llama 3.1 Nemotron Nano 4B v1.1 | NVIDIA | 14 | 128k | - | - | - | — |
| Kimi Linear 48B A3B Instruct | Kimi | 14 | 1M | - | - | - | — |
| Llama 3.3 Nemotron Super 49B | NVIDIA | 14 | 128k | - | - | - | — |
| Ring-flash-2.0 | InclusionAI | 14 | 128k | $0.18 | - | - | — |
| Solar Pro 2 | Upstage | 14 | 65.5k | - | - | - | upstage |
| Llama 4 Scout | Meta | 14 | 10M | $0.22 | 114 | 0.9 | llmgateway |
| Command A | Cohere | 13 | 256k | $3.25 | 48 | 2.2 | — |
| Llama 3.1 Nemotron 70B | NVIDIA | 13 | 128k | $1.20 | 287 | 0.5 | — |
| NVIDIA Nemotron 3 Nano | NVIDIA | 13 | 1M | $0.07 | 74 | 0.5 | venice |
| NVIDIA Nemotron Nano 9B V2 | NVIDIA | 13 | 131k | $0.06 | 144 | 1.1 | openrouter |
| MiniCPM-V 4.6 1.3B | OpenBMB | 13 | 262k | - | - | - | — |
| Granite 4.1 8B | IBM | 12 | 131k | $0.06 | 125 | 0.8 | — |
| Sarvam 30B (high) | Sarvam | 12 | 65.5k | $0.00 | 252 | 2.0 | sarvam |
| Gemma 4 E2B | 12 | 128k | - | - | - | — | |
| R1 1776 | Perplexity | 12 | 128k | - | - | - | — |
| Llama 3.2 90B (Vision) | Meta | 12 | 128k | $1.38 | 58 | 1.3 | — |
| EXAONE 4.0 32B | LG AI Research | 12 | 131k | - | - | - | — |
| Ministral 3 3B | Mistral | 11 | 256k | $0.10 | 172 | 0.5 | ollama-cloud |
| Jamba 1.7 Large | AI21 Labs | 11 | 256k | $2.60 | 58 | 1.6 | — |
| Granite 4.0 H Small | IBM | 11 | 128k | $0.08 | 465 | 10.2 | — |
| Qwen3 Omni 30B A3B | Alibaba | 11 | 65.5k | $0.32 | 95 | 1.9 | — |
| Qwen3.5 0.8B | Alibaba | 11 | 262k | $0.01 | - | - | — |
| LFM2 24B A2B | Liquid AI | 10 | 32.8k | $0.04 | 245 | 0.5 | — |
| Phi-4 | Microsoft | 10 | 16k | $0.16 | 15 | 2.5 | azure-cognitive-services |
| Nova Micro | Amazon | 10 | 130k | $0.03 | 265 | 1.0 | — |
| NVIDIA Nemotron Nano 12B v2 VL | NVIDIA | 10 | 128k | $0.24 | 236 | 1.0 | vercel |
| Phi-4 Multimodal | Microsoft | 10 | 128k | $0.00 | 16 | 0.8 | azure-cognitive-services |
| Qwen3.5 0.8B | Alibaba | 10 | 262k | $0.01 | 86 | 0.4 | — |
| Jamba Reasoning 3B | AI21 Labs | 10 | 262k | - | - | - | — |
| Reka Flash 3 | Reka AI | 10 | 128k | $0.26 | - | - | — |
| Ling-mini-2.0 | InclusionAI | 9 | 131k | - | - | - | — |
| Llama 3.2 11B (Vision) | Meta | 9 | 128k | $0.25 | 48 | 0.7 | — |
| Granite 4.1 3B | IBM | 9 | 131k | - | - | - | — |
| Phi-4 Mini | Microsoft | 8 | 128k | $0.00 | - | - | azure-cognitive-services |
| Exaone 4.0 1.2B | LG AI Research | 8 | 64k | - | - | - | — |
| Exaone 4.0 1.2B | LG AI Research | 8 | 64k | - | - | - | — |
| LFM2.5-1.2B-Thinking | Liquid AI | 8 | 32k | - | - | - | — |
| Jamba 1.7 Mini | AI21 Labs | 8 | 258k | - | - | - | — |
| LFM2 2.6B | Liquid AI | 8 | 32.8k | $0.00 | - | - | — |
| LFM2.5-1.2B-Instruct | Liquid AI | 8 | 32k | $0.00 | - | - | — |
| Granite 4.0 H 1B | IBM | 8 | 128k | - | - | - | — |
| Gemma 3 270M | 8 | 32k | - | - | - | — | |
| Apertus 70B Instruct | Swiss AI Initiative | 8 | 65.5k | $1.03 | - | - | — |
| Granite 4.0 Micro | IBM | 8 | 128k | - | - | - | — |
| Granite 4.0 1B | IBM | 7 | 128k | - | - | - | — |
| LFM2 8B A1B | Liquid AI | 7 | 32.8k | $0.00 | - | - | — |
| LFM2.5-VL-1.6B | Liquid AI | 6 | 32k | $0.00 | - | - | — |
| Granite 4.0 350M | IBM | 6 | 32.8k | - | - | - | — |
| Apertus 8B Instruct | Swiss AI Initiative | 6 | 65.5k | $0.11 | - | - | — |
| Granite 4.0 H 350M | IBM | 5 | 32.8k | - | - | - | — |
| Tiny Aya Global | Cohere | 5 | 8.19k | $0.00 | - | - | — |
| EXAONE 4.5 33B | LG AI Research | - | 262k | - | - | - | — |
| Gemini 3 Deep Think | - | 128k | - | - | - | — | |
| Mi:dm K 2.5 Pro Preview | Korea Telecom | - | 128k | - | - | - | — |
| GPT-5.5 Pro (xhigh) | OpenAI | - | 922k | - | - | - | openai |