Arena WebDev
80 models · Updated May 23, 2026 at 01:52 PM
| # ▲ | Model | Provider | Score | +/− | Votes | Price $/M | Context |
|---|---|---|---|---|---|---|---|
| 1 | claude-opus-4-7-thinking | Anthropic | 1567 | +10/-10 | 4,462 | $5 / $25 | 1M |
| 2 | claude-opus-4-7 | Anthropic | 1560 | +10/-10 | 4,186 | $5 / $25 | 1M |
| 3 | claude-opus-4-6-thinking | Anthropic | 1545 | +8/-8 | 7,241 | $5 / $25 | 1M |
| 4 | claude-opus-4-6 | Anthropic | 1540 | +8/-8 | 8,242 | $5 / $25 | 1M |
| 5 | glm-5.1 | - | 1532 | +11/-11 | 3,611 | $1.40 / $4.40 | 202.8K |
| 6 | claude-sonnet-4-6 | Anthropic | 1524 | +7/-7 | 10,408 | $3 / $15 | 1M |
| 7 | kimi-k2.6 | - | 1519 | +11/-11 | 3,437 | $0.95 / $4 | 262.1K |
| 8 | muse-sparkPreliminary | Meta | 1509 | +16/-16 | 1,630 | N/A | N/A |
| 9 | gemini-3.5-flashPreliminary | - | 1507 | +14/-14 | 2,148 | $1.50 / $9 | 1M |
| 10 | gpt-5.5-xhigh (codex-harness) | - | 1503 | +11/-11 | 3,456 | N/A | N/A |
| 11 | qwen3.6-max-preview | - | 1491 | +14/-14 | 2,117 | $1.04 / $6.24 | 262.1K |
| 12 | claude-opus-4-5-20251101-thinking-32k | Anthropic | 1490 | +7/-7 | 13,067 | $5 / $25 | 200K |
| 13 | gpt-5.5-high (codex-harness) | - | 1480 | +11/-11 | 3,664 | N/A | N/A |
| 14 | mimo-v2.5-pro | - | 1471 | +10/-10 | 4,087 | $1 / $3 | 1M |
| 15 | claude-opus-4-5-20251101 | Anthropic | 1467 | +6/-6 | 15,307 | $5 / $25 | 200K |
| 16 | qwen3.6-plus | - | 1461 | +9/-9 | 5,421 | $0.33 / $1.95 | 1M |
| 17 | deepseek-v4-pro-thinking | - | 1459 | +11/-11 | 3,320 | $0.43 / $0.87 | 1M |
| 18 | gpt-5.4-high (codex-harness) | - | 1457 | +17/-17 | 1,482 | $2.50 / $15 | 1.1M |
| 19 | gemini-3.1-pro-preview | - | 1450 | +7/-7 | 9,555 | $2 / $12 | 1M |
| 20 | gpt-5.5 (codex-harness) | - | 1440 | +11/-11 | 3,436 | N/A | N/A |
| 21 | glm-4.7 | - | 1440 | +10/-10 | 4,885 | $0.40 / $1.75 | 202.8K |
| 22 | gemini-3-pro | - | 1438 | +7/-7 | 17,161 | $2 / $12 | 1M |
| 23 | mimo-v2.5 | - | 1438 | +11/-11 | 3,056 | $0.40 / $2 | 1M |
| 24 | gpt-5.4-medium (codex-harness) | - | 1437 | +16/-16 | 1,448 | $2.50 / $15 | 1.1M |
| 25 | gemini-3-flash | - | 1437 | +7/-7 | 13,276 | $0.50 / $3 | 1M |
| 26 | glm-5 | - | 1435 | +8/-8 | 6,573 | $1 / $3.20 | 202.8K |
| 27 | mimo-v2-pro | - | 1432 | +8/-8 | 6,340 | $1 / $3 | 1M |
| 28 | kimi-k2.5-thinking | - | 1430 | +7/-7 | 10,116 | $0.60 / $3 | N/A |
| 29 | kimi-k2.5-instant | - | 1408 | +11/-11 | 3,609 | $0.40 / $1.90 | 262.1K |
| 30 | gpt-5.3-codex (codex-harness) | - | 1406 | +12/-12 | 2,962 | $1.75 / $14 | 400K |
| 31 | minimax-m2.7 | - | 1405 | +9/-9 | 5,723 | $0.28 / $1.20 | 204.8K |
| 32 | gpt-5.2 | - | 1404 | +17/-17 | 1,456 | $1.75 / $14 | 400K |
| 33 | gpt-5.4-mini-high | - | 1402 | +9/-9 | 4,862 | $0.75 / $4.50 | 400K |
| 34 | grok-4.20-beta-0309-reasoning | - | 1396 | +8/-8 | 6,584 | $2 / $6 | 2M |
| 35 | gpt-5-medium | - | 1394 | +13/-13 | 3,753 | $1.25 / $10 | 400K |
| 36 | minimax-m2.1-preview | - | 1392 | +8/-8 | 9,279 | $0.29 / $0.95 | 204.8K |
| 37 | gpt-5.1-medium | - | 1391 | +9/-9 | 6,117 | $1.25 / $10 | 400K |
| 38 | qwen3.5-397b-a17b | - | 1389 | +7/-7 | 9,045 | $0.39 / $2.34 | 262.1K |
| 39 | gemini-3-flash (thinking-minimal) | - | 1389 | +6/-6 | 15,745 | $0.50 / $3 | 1M |
| 40 | claude-sonnet-4-5-20250929-thinking-32k | Anthropic | 1388 | +7/-7 | 15,738 | $3 / $15 | 200K |
| 41 | gpt-5.4 | - | 1386 | +41/-41 | 210 | $2.50 / $15 | 1.1M |
| 42 | claude-sonnet-4-5-20250929 | Anthropic | 1386 | +6/-6 | 18,403 | $3 / $15 | 200K |
| 43 | claude-opus-4-1-20250805 | Anthropic | 1385 | +9/-9 | 8,563 | $15 / $75 | 200K |
| 44 | grok-4.3 | - | 1385 | +12/-12 | 2,759 | $1.25 / $2.50 | 1M |
| 45 | minimax-m2.5 | - | 1382 | +8/-8 | 7,862 | $0.15 / $1.15 | 204.8K |
| 46 | gemma-4-31b | - | 1380 | +11/-11 | 2,969 | $0.14 / $0.40 | 262.1K |
| 47 | gpt-5.3-codex (codex-harness) | - | 1373 | +11/-11 | 3,559 | $1.75 / $14 | 400K |
| 48 | deepseek-v3.2-thinking | - | 1368 | +8/-8 | 7,918 | $0.25 / $0.38 | 131.1K |
| 49 | hunyuan-hy3-preview | Tencent | 1365 | +17/-17 | 1,350 | N/A | N/A |
| 50 | qwen3.5-122b-a10b | - | 1365 | +7/-7 | 7,737 | $0.26 / $2.08 | 262.1K |
| 51 | gemma-4-26b-a4b | - | 1361 | +16/-16 | 1,506 | N/A | N/A |
| 52 | qwen3.5-27b | - | 1356 | +8/-8 | 7,284 | $0.20 / $1.56 | 262.1K |
| 53 | glm-4.6 | - | 1355 | +9/-9 | 8,352 | $0.43 / $1.74 | 202.8K |
| 54 | gpt-5.1 | - | 1340 | +7/-7 | 12,865 | $1.25 / $10 | 400K |
| 55 | mimo-v2-flash (non-thinking) | - | 1337 | +8/-8 | 6,739 | $0.10 / $0.30 | 262.1K |
| 56 | gpt-5.2-codex | - | 1334 | +8/-8 | 7,767 | $1.75 / $14 | 400K |
| 57 | deepseek-v3.2 | - | 1332 | +7/-7 | 10,473 | $0.25 / $0.38 | 131.1K |
| 58 | kimi-k2-thinking-turbo | - | 1330 | +6/-6 | 15,362 | $1.15 / $8 | 262.1K |
| 59 | gpt-5.1-codex | - | 1329 | +10/-10 | 6,218 | $1.25 / $10 | 400K |
| 60 | claude-haiku-4-5-20251001 | Anthropic | 1321 | +6/-6 | 19,976 | $1 / $5 | 200K |
| 61 | minimax-m2 | - | 1305 | +9/-9 | 8,398 | $0.26 / $1 | 204.8K |
| 62 | mimo-v2-flash (thinking) | - | 1300 | +14/-14 | 2,098 | $0.10 / $0.30 | 262.1K |
| 63 | deepseek-v3.2-exp | - | 1287 | +11/-11 | 4,869 | $0.27 / $0.41 | 163.8K |
| 64 | qwen3-coder-480b-a35b-instruct | - | 1281 | +7/-7 | 15,218 | $0.40 / $1.60 | 262.1K |
| 65 | -Coder-Pro-V1 | KwaiKAT | 1258 | +15/-15 | 1,881 | $0.21 / $0.83 | 256K |
| 66 | qwen3.5-35b-a3b | - | 1249 | +16/-16 | 1,813 | $0.14 / $1 | 262.1K |
| 67 | trinity-large-thinkingArcee AI | - | 1246 | +19/-19 | 1,312 | $0.22 / $0.85 | 262.1K |
| 68 | gemini-3.1-flash-lite-preview | - | 1245 | +8/-8 | 8,671 | $0.25 / $1.50 | 1M |
| 69 | gpt-5.1-codex-mini | - | 1239 | +17/-17 | 1,442 | $0.25 / $2 | 400K |
| 70 | qwen3.5-flash | - | 1237 | +17/-17 | 1,561 | N/A | N/A |
| 71 | grok-4-1-fast-reasoning | - | 1234 | +9/-9 | 6,910 | $0.20 / $0.50 | 2M |
| 72 | mistral-large-3 | - | 1223 | +20/-20 | 1,032 | $0.50 / $1.50 | N/A |
| 73 | grok-4.1-thinking | - | 1208 | +20/-20 | 1,209 | N/A | N/A |
| 74 | gemini-2.5-pro | - | 1203 | +13/-13 | 3,297 | $1.25 / $10 | 1M |
| 75 | granite-4.1-8b | - | 1201 | +19/-19 | 1,419 | $0.05 / $0.10 | 131.1K |
| 76 | devstral-2 | - | 1199 | +17/-17 | 1,583 | N/A | N/A |
| 77 | mercury-2Inception AI | - | 1165 | +23/-23 | 947 | $0.25 / $0.75 | 128K |
| 78 | grok-4-fast-reasoning | - | 1149 | +23/-23 | 933 | $0.20 / $0.50 | 2M |
| 79 | grok-code-fast-1 | - | 1140 | +22/-22 | 982 | $0.20 / $1.50 | N/A |
| 80 | devstral-medium-2507 | - | 1092 | +23/-23 | 993 | $0.40 / $2 | 128K |