context.vn

lcr

115 models evaluated

#ModelProviderTypeScore
1GPT-5.2-CodexOpenAIClosed75.7%
2GPT-5 (high)OpenAIClosed75.6%
3GPT-5.1OpenAIClosed75.0%
4GPT-5.5OpenAIClosed74.3%
5GPT-5.4OpenAIClosed74.0%
6GPT-5.3 CodexOpenAIClosed74.0%
7Claude Opus 4.5 ThinkingAnthropicClosed74.0%
8MiMo-V2.5-ProXiaomiClosed73.3%
9GPT-5 (medium)OpenAIClosed72.8%
10Gemini 3.1 ProGoogleClosed72.7%
11GPT-5.2OpenAIClosed72.7%
12Gemini 3 ProGoogleClosed70.7%
13Claude Opus 4.6 (Adaptive)AnthropicClosed70.7%
14Claude Opus 4.7 (Adaptive)AnthropicClosed70.3%
15Kimi K2.6Moonshot AIOpen69.7%
16Qwen3.6 PlusAlibabaClosed69.7%
17Muse SparkMetaClosed69.7%
18Qwen 3.6 Max (preview)AlibabaClosed69.7%
19Gemini 3.5 FlashGoogleClosed69.3%
20o3OpenAIClosed69.3%
21GPT-5.4 miniOpenAIClosed69.3%
22Qwen3.7 MaxAlibabaClosed69.0%
23Qwen3.6-27BAlibabaOpen68.7%
24MiniMax M2.7MiniMaxOpen68.7%
25Grok 4xAIClosed68.0%
26Grok 4.1 Fast (Reasoning)xAIClosed68.0%
27GPT-5.1-Codex-MaxOpenAIClosed67.3%
28Qwen3.5-27BAlibabaOpen67.3%
29GPT-5.1-CodexOpenAIClosed67.3%
30Claude Opus 4.7AnthropicClosed67.0%
31Qwen3.5-122B-A10BAlibabaOpen66.7%
32MiMo-V2-OmniXiaomiClosed66.7%
33DeepSeek V4 Pro (Max)DeepSeekOpen66.3%
34Claude 4.1 Opus ThinkingAnthropicClosed66.3%
35Gemini 2.5 ProGoogleClosed66.0%
36GPT-5.4 nanoOpenAIClosed66.0%
37Qwen3.5 397B (Reasoning)AlibabaOpen65.7%
38Claude Opus 4.5AnthropicClosed65.3%
39Kimi K2.5 (Reasoning)Moonshot AIClosed65.3%
40Kimi K2.5Moonshot AIOpen65.3%
41Gemini 3.1 Flash-LiteGoogleClosed65.3%
42DeepSeek V4 Pro (High)DeepSeekOpen65.0%
43Grok 4 Fast (Reasoning)xAIClosed64.7%
44Grok 4.3xAIClosed64.3%
45GLM-4.7Z.AIOpen64.0%
46Qwen3.6-35B-A3BAlibabaOpen63.7%
47GLM-5Z.AIOpen63.3%
48DeepSeek V4 Flash (Max)DeepSeekOpen63.0%
49DeepSeek V4 Flash (High)DeepSeekOpen62.7%
50Qwen3.5-35B-A3BAlibabaOpen62.7%
51GLM-5.1Z.AIOpen62.3%
52Gemma 4 31BGoogleOpen62.0%
53GPT-4.1OpenAIClosed61.0%
54Mistral Medium 3.5 128BMistralOpen61.0%
55GLM-5V-TurboZ.AIClosed61.0%
56MiMo-V2-ProXiaomiClosed60.7%
57GLM-5-TurboZ.AIClosed60.7%
58o1OpenAIClosed59.3%
59Claude Opus 4.6AnthropicClosed58.3%
60Qwen3.5 397BAlibabaOpen58.0%
61Claude Sonnet 4.6AnthropicClosed57.7%
62Gemma 4 26B A4BGoogleOpen55.7%
63K-ExaoneLG AI ResearchClosed55.7%
64DeepSeek-R1DeepSeekOpen54.7%
65Hy3 PreviewTencentOpen54.7%
66DeepSeek V3.1 (Reasoning)DeepSeekOpen53.3%
67Kimi K2Moonshot AIClosed51.0%
68GPT-OSS 120BOpenAIOpen50.7%
69Grok Code Fast 1xAIClosed48.3%
70Gemini 3 FlashGoogleClosed48.0%
71Qwen3 MaxAlibabaClosed46.7%
72Llama 4 MaverickMetaOpen46.0%
73Command A+CohereOpen46.0%
74Gemini 2.5 FlashGoogleClosed45.9%
75DeepSeek V3.1DeepSeekOpen45.0%
76Mistral Small 4 (Reasoning)MistralOpen44.7%
77Mistral Small 4MistralOpen44.7%
78Claude 4 SonnetAnthropicClosed44.3%
79GLM-4.5-AirZ.AIClosed43.7%
80GPT-4.1 miniOpenAIClosed42.3%
81DeepSeek V3.2DeepSeekOpen39.0%
82Nemotron 3 Nano Omni 30B A3BNVIDIAOpen35.7%
83Mistral Large 3MistralClosed34.7%
84Trinity-Large-PreviewArcee AIOpen33.0%
85Trinity-Large-ThinkingArcee AIOpen33.0%
86MiMo-V2-FlashXiaomiOpen31.3%
87GPT-OSS 20BOpenAIOpen30.7%
88Gemma 4 E4BGoogleOpen30.7%
89DeepSeek V3DeepSeekOpen29.0%
90Mistral Medium 3MistralClosed28.0%
91GLM-4.6Z.AIOpen26.3%
92Llama 4 ScoutMetaOpen25.8%
93Ling 2.6 FlashInclusionAIOpen25.0%
94Llama 3.1 405BMetaOpen24.3%
95Grok 4.1 FastxAIClosed22.0%
96Claude 3 HaikuAnthropicClosed21.0%
97Nova ProAmazonClosed19.0%
98GPT-4.1 nanoOpenAIClosed17.0%
99Gemma 4 E2BGoogleOpen15.0%
100DeepSeek R1 Distill Qwen 32BDeepSeekOpen9.7%
101Exaone 4.0 32BLG AI ResearchOpen8.0%
102Nemotron Ultra 253BNVIDIAOpen7.3%
103Nemotron 3 Nano 30BNVIDIAOpen6.7%
104Granite-4.0-H-1BIBMOpen6.3%
105Gemma 3 27BGoogleOpen5.7%
106Mistral Large 2MistralClosed5.3%
107Granite-4.0-1BIBMOpen4.0%
108GPT-4oOpenAIClosed0.0%
109Sarvam 105BSarvamOpen0.0%
110Phi-4MicrosoftOpen0.0%
111Sarvam 30BSarvamOpen0.0%
112Solar Pro 2UpstageClosed0.0%
113Exaone 4.0 1.2BLG AI ResearchOpen0.0%
114Granite-4.0-350MIBMOpen0.0%
115Granite-4.0-H-350MIBMOpen0.0%