context.vn

ifeval

19 models evaluated

#ModelProviderTypeScore
1Qwen3.5-27BAlibabaOpen95%
2Qwen3.7 MaxAlibabaClosed94.3%
3Qwen3.6 PlusAlibabaClosed94.3%
4Kimi K2.5Moonshot AIOpen93.9%
5o3-miniOpenAIClosed93.9%
6Qwen3.5-122B-A10BAlibabaOpen93.4%
7GLM-5Z.AIOpen92.6%
8Qwen3.5 397BAlibabaOpen92.6%
9o1OpenAIClosed92.2%
10Qwen3.5-35B-A3BAlibabaOpen91.9%
11LFM2.5-8B-A1BLiquidAIOpen91.8%
12Claude Opus 4.5AnthropicClosed90.9%
13GPT-4.1 miniOpenAIClosed88.5%
14GPT-4.1OpenAIClosed87.4%
15DeepSeek V3DeepSeekOpen86.1%
16ZAYA1-8BZyphraOpen85.6%
17GPT-4.1 nanoOpenAIClosed83.2%
18MiniCPM5-1BOpenBMBOpen80.4%
19LFM2.5-VL-450MLiquidAIOpen61.2%