context.vn

aa Coding Index

119 models evaluated

#ModelProviderTypeScore
1GPT-5.5OpenAIClosed59.1%
2GPT-5.4OpenAIClosed57.3%
3Gemini 3.1 ProGoogleClosed55.5%
4GPT-5.3 CodexOpenAIClosed53.1%
5Claude Opus 4.7AnthropicClosed53.1%
6Claude Opus 4.7 (Adaptive)AnthropicClosed52.5%
7GPT-5.4 miniOpenAIClosed51.5%
8Qwen3.7 MaxAlibabaClosed50.1%
9GPT-5.2OpenAIClosed48.7%
10Claude Opus 4.6 (Adaptive)AnthropicClosed48.1%
11Claude Opus 4.5 ThinkingAnthropicClosed47.8%
12Claude Opus 4.6AnthropicClosed47.6%
13DeepSeek V4 Pro (Max)DeepSeekOpen47.5%
14Muse SparkMetaClosed47.5%
15Kimi K2.6Moonshot AIOpen47.1%
16Gemini 3 ProGoogleClosed46.5%
17Claude Sonnet 4.6AnthropicClosed46.4%
18MiMo-V2.5-ProXiaomiClosed45.5%
19Gemini 3.5 FlashGoogleClosed45.0%
20Qwen 3.6 Max (preview)AlibabaClosed44.9%
21GPT-5.1OpenAIClosed44.7%
22GLM-5Z.AIOpen44.2%
23GPT-5.4 nanoOpenAIClosed43.9%
24GLM-5.1Z.AIOpen43.4%
25DeepSeek V4 Pro (High)DeepSeekOpen43.3%
26GPT-5.2-CodexOpenAIClosed43.0%
27Claude Opus 4.5AnthropicClosed42.9%
28Qwen3.6 PlusAlibabaClosed42.9%
29MiniMax M2.7MiniMaxOpen41.9%
30MiMo-V2-ProXiaomiClosed41.4%
31Qwen3.5 397B (Reasoning)AlibabaOpen41.3%
32Grok 4.3xAIClosed41.0%
33Grok 4xAIClosed40.5%
34DeepSeek V4 Flash (High)DeepSeekOpen39.8%
35Kimi K2.5 (Reasoning)Moonshot AIClosed39.5%
36Kimi K2.5Moonshot AIOpen39.5%
37GPT-5 (medium)OpenAIClosed39.0%
38DeepSeek V4 Flash (Max)DeepSeekOpen38.7%
39Gemma 4 31BGoogleOpen38.7%
40o3OpenAIClosed38.4%
41Gemini 3 FlashGoogleClosed37.8%
42Qwen3.5 397BAlibabaOpen37.4%
43GLM-5-TurboZ.AIClosed36.8%
44GPT-5.1-Codex-MaxOpenAIClosed36.6%
45GPT-5.1-CodexOpenAIClosed36.6%
46Claude 4.1 Opus ThinkingAnthropicClosed36.5%
47Qwen3.6-27BAlibabaOpen36.5%
48Hy3 PreviewTencentOpen36.5%
49GLM-4.7Z.AIOpen36.3%
50GLM-5V-TurboZ.AIClosed36.2%
51GPT-5 (high)OpenAIClosed36.0%
52MiMo-V2-OmniXiaomiClosed35.5%
53Mistral Medium 3.5 128BMistralOpen35.4%
54Qwen3.6-35B-A3BAlibabaOpen35.1%
55Qwen3.5-27BAlibabaOpen34.9%
56Qwen3.5-122B-A10BAlibabaOpen34.7%
57DeepSeek V3.2DeepSeekOpen34.6%
58o1-previewOpenAIClosed34.0%
59Gemini 2.5 ProGoogleClosed31.9%
60Grok 4.1 Fast (Reasoning)xAIClosed30.9%
61Claude 4 SonnetAnthropicClosed30.6%
62Qwen3.5-35B-A3BAlibabaOpen30.3%
63GLM-4.6Z.AIOpen30.2%
64Gemini 3.1 Flash-LiteGoogleClosed30.1%
65DeepSeek V3.1 (Reasoning)DeepSeekOpen29.7%
66Command A+CohereOpen29.3%
67GPT-OSS 120BOpenAIOpen28.6%
68DeepSeek V3.1DeepSeekOpen28.4%
69Grok 4 Fast (Reasoning)xAIClosed27.4%
70Trinity-Large-PreviewArcee AIOpen27.2%
71Trinity-Large-ThinkingArcee AIOpen27.2%
72K-ExaoneLG AI ResearchClosed27.0%
73Qwen3 MaxAlibabaClosed26.4%
74MiMo-V2-FlashXiaomiOpen25.8%
75Mistral Small 4 (Reasoning)MistralOpen24.3%
76Mistral Small 4MistralOpen24.3%
77DeepSeek-R1DeepSeekOpen24.0%
78GLM-4.5-AirZ.AIClosed23.8%
79Grok Code Fast 1xAIClosed23.7%
80Gemini 1.5 ProGoogleClosed23.6%
81Ling 2.6 FlashInclusionAIOpen23.2%
82Mistral Large 3MistralClosed22.7%
83Gemma 4 26B A4BGoogleOpen22.4%
84Kimi K2Moonshot AIClosed22.1%
85GPT-4.1OpenAIClosed21.8%
86GPT-4 TurboOpenAIClosed21.5%
87o1OpenAIClosed20.5%
88Claude 3 OpusAnthropicClosed19.5%
89Grok 4.1 FastxAIClosed19.5%
90GPT-OSS 20BOpenAIOpen18.5%
91GPT-4.1 miniOpenAIClosed18.5%
92o3-miniOpenAIClosed17.9%
93Gemini 2.5 FlashGoogleClosed17.8%
94GPT-4oOpenAIClosed16.7%
95DeepSeek V3DeepSeekOpen16.4%
96Nemotron 3 Nano 30BNVIDIAOpen15.8%
97Llama 4 MaverickMetaOpen15.6%
98Nemotron 3 Nano Omni 30B A3BNVIDIAOpen14.8%
99Llama 3.1 405BMetaOpen14.5%
100Mistral Large 2MistralClosed13.8%
101Gemma 4 E4BGoogleOpen13.7%
102Mistral Medium 3MistralClosed13.6%
103Nemotron Ultra 253BNVIDIAOpen13.1%
104Solar Pro 2UpstageClosed11.3%
105Phi-4MicrosoftOpen11.2%
106GPT-4.1 nanoOpenAIClosed11.2%
107Nova ProAmazonClosed11.0%
108Sarvam 105BSarvamOpen9.8%
109Gemma 3 27BGoogleOpen9.6%
110Exaone 4.0 32BLG AI ResearchOpen9.4%
111Gemma 4 E2BGoogleOpen9.0%
112Sarvam 30BSarvamOpen7.9%
113Claude 3 HaikuAnthropicClosed6.7%
114Llama 4 ScoutMetaOpen6.7%
115Granite-4.0-1BIBMOpen2.9%
116Granite-4.0-H-1BIBMOpen2.7%
117Exaone 4.0 1.2BLG AI ResearchOpen2.5%
118Granite-4.0-H-350MIBMOpen0.6%
119Granite-4.0-350MIBMOpen0.3%