context.vn

mmlu Pro X

10 models evaluated

#ModelProviderTypeScore
1Qwen3.7 MaxAlibabaClosed87%
2Claude Opus 4.5AnthropicClosed85.7%
3Qwen3.6 PlusAlibabaClosed84.7%
4Qwen3.5 397BAlibabaOpen84.7%
5GLM-5Z.AIOpen83.1%
6Kimi K2.5Moonshot AIOpen82.3%
7Qwen3.5-122B-A10BAlibabaOpen82.2%
8Qwen3.5-27BAlibabaOpen82.2%
9Qwen3.5-35B-A3BAlibabaOpen81%
10Qwen3 235B 2507AlibabaOpen79.4%