claw Eval
22 models evaluated
|
| 1 | Claude Opus 4.6 | Anthropic | Closed | opus46 | |
| 2 | Claude Sonnet 4.6 | Anthropic | Closed | sonnet46 | |
| 3 | MiMo-V2.5-Pro | Xiaomi | Closed | mimo_v25_pro | |
| 4 | Muse Spark | muse_spark | Meta | Closed | |
| 5 | Kimi K2.6 | kimi_k26 | Moonshot AI | Open | |
| 6 | MiMo-V2.5 | mimo_v25 | Xiaomi | Closed | |
| 7 | GLM-5.1 | glm51 | Z.AI | Open | |
| 8 | GPT-5.4 | gpt54 | OpenAI | Closed | |
| 9 | DeepSeek V4 Pro | deepseek_v4_pro | DeepSeek | Open | |
| 10 | Qwen3.6 Plus | qwen3.6_plus | Alibaba | Closed | |
| 11 | Gemini 3.1 Pro | gemini31_pro | Google | Closed | |
| 12 | DeepSeek V4 Flash | deepseek_v4_flash | DeepSeek | Open | |
| 13 | MiMo-V2-Pro | mimo_v2_pro | Xiaomi | Closed | |
| 14 | Qwen3.5 397B | qwen3.5-397b-a17b | Alibaba | Open | |
| 15 | GLM-5-Turbo | glm5_turbo | Z.AI | Closed | |
| 16 | GLM-5V-Turbo | glm5v_turbo | Z.AI | Closed | |
| 17 | Kimi K2.5 | kimi_k25 | Moonshot AI | Open | |
| 19 | Gemini 3 Flash | gemini3_flash | Google | Closed | |
| 20 | MiniMax M2.7 | minimax_m27 | MiniMax | Open | |
| 21 | MiMo-V2-Omni | mimo_v2_omni | Xiaomi | Closed | |
| 22 | DeepSeek V3.2 | deepseek_v32 | DeepSeek | Open | |
| 23 | Nemotron 3 Super 100B | nemotron3_super | NVIDIA | Open | |