context.vn

claw Eval

22 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.6AnthropicClosedopus46
2Claude Sonnet 4.6AnthropicClosedsonnet46
3MiMo-V2.5-ProXiaomiClosedmimo_v25_pro
4Muse Sparkmuse_sparkMetaClosed
5Kimi K2.6kimi_k26Moonshot AIOpen
6MiMo-V2.5mimo_v25XiaomiClosed
7GLM-5.1glm51Z.AIOpen
8GPT-5.4gpt54OpenAIClosed
9DeepSeek V4 Prodeepseek_v4_proDeepSeekOpen
10Qwen3.6 Plusqwen3.6_plusAlibabaClosed
11Gemini 3.1 Progemini31_proGoogleClosed
12DeepSeek V4 Flashdeepseek_v4_flashDeepSeekOpen
13MiMo-V2-Promimo_v2_proXiaomiClosed
14Qwen3.5 397Bqwen3.5-397b-a17bAlibabaOpen
15GLM-5-Turboglm5_turboZ.AIClosed
16GLM-5V-Turboglm5v_turboZ.AIClosed
17Kimi K2.5kimi_k25Moonshot AIOpen
19Gemini 3 Flashgemini3_flashGoogleClosed
20MiniMax M2.7minimax_m27MiniMaxOpen
21MiMo-V2-Omnimimo_v2_omniXiaomiClosed
22DeepSeek V3.2deepseek_v32DeepSeekOpen
23Nemotron 3 Super 100Bnemotron3_superNVIDIAOpen