context.vn

artificial Analysis

126 models evaluated

#ModelProviderTypeScore
1GPT-5.5OpenAIClosed60.2%
2Claude Opus 4.7 (Adaptive)AnthropicClosed57.3%
3Gemini 3.1 ProGoogleClosed57.2%
4GPT-5.4OpenAIClosed56.8%
5Qwen3.7 MaxAlibabaClosed56.6%
6Gemini 3.5 FlashGoogleClosed55.3%
7Kimi K2.6Moonshot AIOpen53.9%
8MiMo-V2.5-ProXiaomiClosed53.8%
9GPT-5.3 CodexOpenAIClosed53.6%
10Grok 4.3xAIClosed53.2%
11Claude Opus 4.6 (Adaptive)AnthropicClosed53.0%
12Muse SparkMetaClosed52.1%
13Claude Opus 4.7AnthropicClosed51.8%
14Qwen 3.6 Max (preview)AlibabaClosed51.8%
15DeepSeek V4 Pro (Max)DeepSeekOpen51.5%
16GLM-5.1Z.AIOpen51.4%
17GPT-5.2OpenAIClosed51.3%
18Qwen3.6 PlusAlibabaClosed50.0%
19DeepSeek V4 Pro (High)DeepSeekOpen49.8%
20GLM-5Z.AIOpen49.8%
21Claude Opus 4.5 ThinkingAnthropicClosed49.7%
22MiniMax M2.7MiniMaxOpen49.6%
23MiMo-V2-ProXiaomiClosed49.2%
24GPT-5.2-CodexOpenAIClosed49.0%
25GPT-5.4 miniOpenAIClosed48.9%
26Gemini 3 ProGoogleClosed48.4%
27GPT-5.1OpenAIClosed47.7%
28Kimi K2.5 (Reasoning)Moonshot AIClosed46.8%
29Kimi K2.5Moonshot AIOpen46.8%
30GLM-5-TurboZ.AIClosed46.8%
31DeepSeek V4 Flash (Max)DeepSeekOpen46.5%
32Claude Opus 4.6AnthropicClosed46.5%
33DeepSeek V4 Flash (High)DeepSeekOpen46.0%
34Qwen3.6-27BAlibabaOpen45.8%
35Qwen3.5 397B (Reasoning)AlibabaOpen45.0%
36GPT-5 (high)OpenAIClosed44.6%
37Claude Sonnet 4.6AnthropicClosed44.4%
38GPT-5.4 nanoOpenAIClosed44.0%
39Qwen3.6-35B-A3BAlibabaOpen43.5%
40MiMo-V2-OmniXiaomiClosed43.4%
41GPT-5.1-Codex-MaxOpenAIClosed43.1%
42GPT-5.1-CodexOpenAIClosed43.1%
43Claude Opus 4.5AnthropicClosed43.1%
44GLM-5V-TurboZ.AIClosed42.9%
45GLM-4.7Z.AIOpen42.1%
46Qwen3.5-27BAlibabaOpen42.1%
47GPT-5 (medium)OpenAIClosed42.0%
48Claude 4.1 Opus ThinkingAnthropicClosed42.0%
49Hy3 PreviewTencentOpen41.9%
50Qwen3.5-122B-A10BAlibabaOpen41.6%
51Grok 4xAIClosed41.5%
52o3-proOpenAIClosed40.7%
53Qwen3.5 397BAlibabaOpen40.1%
54Mistral Medium 3.5 128BMistralOpen39.2%
55Gemma 4 31BGoogleOpen39.2%
56Grok 4.1 Fast (Reasoning)xAIClosed38.6%
57o3OpenAIClosed38.4%
58Command A+CohereOpen37.2%
59Qwen3.5-35B-A3BAlibabaOpen37.1%
60Claude 4.1 OpusAnthropicClosed36.0%
61Grok 4 Fast (Reasoning)xAIClosed35.1%
62Gemini 3 FlashGoogleClosed35.0%
63Gemini 2.5 ProGoogleClosed34.6%
64Gemini 3.1 Flash-LiteGoogleClosed33.5%
65GPT-OSS 120BOpenAIOpen33.3%
66Claude 4 SonnetAnthropicClosed33.0%
67K-ExaoneLG AI ResearchClosed32.1%
68DeepSeek V3.2DeepSeekOpen32.1%
69Trinity-Large-PreviewArcee AIOpen31.9%
70Trinity-Large-ThinkingArcee AIOpen31.9%
71Qwen3 MaxAlibabaClosed31.4%
72Gemma 4 26B A4BGoogleOpen31.2%
73o1OpenAIClosed30.8%
74MiMo-V2-FlashXiaomiOpen30.4%
75GLM-4.6Z.AIOpen30.2%
76Grok Code Fast 1xAIClosed28.7%
77DeepSeek V3.1DeepSeekOpen28.1%
78Mistral Small 4 (Reasoning)MistralOpen27.8%
79Mistral Small 4MistralOpen27.8%
80DeepSeek V3.1 (Reasoning)DeepSeekOpen27.7%
81DeepSeek-R1DeepSeekOpen27.1%
82Kimi K2Moonshot AIClosed26.3%
83GPT-4.1OpenAIClosed26.3%
84Ling 2.6 FlashInclusionAIOpen26.2%
85o3-miniOpenAIClosed25.9%
86o1-proOpenAIClosed25.8%
87GPT-OSS 20BOpenAIOpen24.5%
88o1-previewOpenAIClosed23.7%
89Grok 4.1 FastxAIClosed23.6%
90GLM-4.5-AirZ.AIClosed23.2%
91GPT-4.1 miniOpenAIClosed22.9%
92Mistral Large 3MistralClosed22.8%
93Nemotron 3 Nano Omni 30B A3BNVIDIAOpen21.4%
94Gemini 2.5 FlashGoogleClosed20.6%
95Mistral Medium 3MistralClosed18.8%
96Gemma 4 E4BGoogleOpen18.8%
97Llama 4 MaverickMetaOpen18.4%
98Sarvam 105BSarvamOpen18.2%
99Claude 3 OpusAnthropicClosed18.0%
100Llama 3.1 405BMetaOpen17.4%
101GPT-4oOpenAIClosed17.3%
102DeepSeek R1 Distill Qwen 32BDeepSeekOpen17.2%
103DeepSeek V3DeepSeekOpen16.5%
104Gemini 1.5 ProGoogleClosed16.0%
105Gemma 4 E2BGoogleOpen15.2%
106Mistral Large 2MistralClosed15.1%
107Nemotron Ultra 253BNVIDIAOpen15.0%
108GPT-4 TurboOpenAIClosed13.7%
109Solar Pro 2UpstageClosed13.6%
110Llama 4 ScoutMetaOpen13.5%
111Nova ProAmazonClosed13.5%
112Nemotron 3 Nano 30BNVIDIAOpen13.2%
113GPT-4.1 nanoOpenAIClosed13.0%
114Qwen2.5 Coder 32B InstructAlibabaOpen12.9%
115GPT-4o miniOpenAIClosed12.7%
116Sarvam 30BSarvamOpen12.3%
117Claude 3 HaikuAnthropicClosed12.3%
118Exaone 4.0 32BLG AI ResearchOpen11.7%
119Phi-4MicrosoftOpen10.4%
120Gemma 3 27BGoogleOpen10.3%
121Gemini 1.0 ProGoogleClosed8.5%
122Exaone 4.0 1.2BLG AI ResearchOpen8.1%
123Granite-4.0-H-1BIBMOpen8.0%
124Granite-4.0-1BIBMOpen7.3%
125Granite-4.0-350MIBMOpen6.1%
126Granite-4.0-H-350MIBMOpen5.4%