context.vn

gdpval Aa

113 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.8AnthropicClosed1890
2GPT-5.5OpenAIClosed1769
3Claude Opus 4.7 (Adaptive)AnthropicClosed1753
4Claude Opus 4.7AnthropicClosed1680
5GPT-5.4OpenAIClosed1674
6Gemini 3.5 FlashGoogleClosed1656
7Claude Opus 4.6 (Adaptive)AnthropicClosed1619
8Claude Sonnet 4.6AnthropicClosed1597
9Claude Opus 4.6AnthropicClosed1589
10MiMo-V2.5-ProXiaomiClosed1571
11DeepSeek V4 Pro (High)DeepSeekOpen1558
12DeepSeek V4 Pro (Max)DeepSeekOpen1554
13Qwen3.7 MaxAlibabaClosed1547
14MiniMax M2.7MiniMaxOpen1505
15Qwen 3.6 Max (preview)AlibabaClosed1504
16GLM-5-TurboZ.AIClosed1496
17Grok 4.3xAIClosed1495
18Kimi K2.6Moonshot AIOpen1481
19GPT-5.3 CodexOpenAIClosed1478
20GPT-5.2OpenAIClosed1467
21Claude Opus 4.5 ThinkingAnthropicClosed1450
22GPT-5.4 miniOpenAIClosed1438
23Claude Opus 4.5AnthropicClosed1420
24Muse SparkMetaClosed1417
25DeepSeek V4 Flash (High)DeepSeekOpen1414
26MiMo-V2-ProXiaomiClosed1408
27Qwen3.6-27BAlibabaOpen1404
28GLM-5Z.AIOpen1393
29DeepSeek V4 Flash (Max)DeepSeekOpen1388
30Qwen3.6 PlusAlibabaClosed1351
31GLM-5V-TurboZ.AIClosed1332
32Gemini 3 Pro Deep ThinkGoogleClosed1324
33MiMo-V2-OmniXiaomiClosed1321
34Gemini 3.1 ProGoogleClosed1314
35Qwen3.6-35B-A3BAlibabaOpen1298
36GPT-5 (high)OpenAIClosed1295
37GPT-5.2-CodexOpenAIClosed1289
38Kimi K2.5 (Reasoning)Moonshot AIClosed1286
39Kimi K2.5Moonshot AIOpen1286
40Hy3 PreviewTencentOpen1237
41GPT-5.1OpenAIClosed1227
42Qwen3.5 397BAlibabaOpen1222
43GPT-5.1-Codex-MaxOpenAIClosed1192
44GPT-5.1-CodexOpenAIClosed1192
45GPT-5.4 nanoOpenAIClosed1191
46Qwen3.5 397B (Reasoning)AlibabaOpen1190
47Gemini 3 ProGoogleClosed1186
48GLM-4.7Z.AIOpen1185
49Mistral Medium 3.5 128BMistralOpen1168
50Qwen3.5-27BAlibabaOpen1159
51Claude 4 SonnetAnthropicClosed1128
52Qwen3.5-122B-A10BAlibabaOpen1116
53Gemini 3 FlashGoogleClosed1116
54Gemma 4 31BGoogleOpen1113
55DeepSeek V3.1DeepSeekOpen1078
56MiMo-V2-FlashXiaomiOpen1062
57Grok 4.1 Fast (Reasoning)xAIClosed1044
58Qwen3 MaxAlibabaClosed1040
59Gemma 4 26B A4BGoogleOpen1013
60Grok 4 Fast (Reasoning)xAIClosed1013
61GPT-5 (medium)OpenAIClosed1001
62Grok 4xAIClosed989
63GLM-4.6Z.AIOpen986
64GPT-OSS 120BOpenAIOpen947
65Gemini 3.1 Flash-LiteGoogleClosed925
66Command A+CohereOpen919
67Gemini 2.5 ProGoogleClosed916
68Qwen3.5-35B-A3BAlibabaOpen907
69DeepSeek V3.2DeepSeekOpen876
70Trinity-Large-PreviewArcee AIOpen865
71Trinity-Large-ThinkingArcee AIOpen865
72Mistral Large 3MistralClosed863
73Mistral Small 4 (Reasoning)MistralOpen861
74Mistral Small 4MistralOpen861
75K-ExaoneLG AI ResearchClosed825
76Grok 4.1 FastxAIClosed784
77Ling 2.6 FlashInclusionAIOpen782
78GPT-4.1OpenAIClosed776
79Nemotron 3 Nano Omni 30B A3BNVIDIAOpen765
80Grok Code Fast 1xAIClosed764
81o3OpenAIClosed753
82Gemini 2.5 FlashGoogleClosed741
83Sarvam 105BSarvamOpen739
84o1OpenAIClosed736
85DeepSeek-R1DeepSeekOpen681
86GPT-OSS 20BOpenAIOpen651
87GPT-4.1 miniOpenAIClosed620
88DeepSeek V3.1 (Reasoning)DeepSeekOpen611
89Mistral Medium 3MistralClosed586
90GLM-4.5-AirZ.AIClosed559
91Kimi K2Moonshot AIClosed526
92Solar Pro 2UpstageClosed446
93Llama 4 MaverickMetaOpen436
94DeepSeek V3DeepSeekOpen408
95Nova ProAmazonClosed387
96Claude 3 HaikuAnthropicClosed378
97Sarvam 30BSarvamOpen359
98GPT-4oOpenAIClosed348
99Nemotron 3 Nano 30BNVIDIAOpen348
100Exaone 4.0 32BLG AI ResearchOpen328
101Mistral Large 2MistralClosed323
102GPT-4.1 nanoOpenAIClosed318
103Gemma 4 E4BGoogleOpen302
104Exaone 4.0 1.2BLG AI ResearchOpen295
105Granite-4.0-H-350MIBMOpen292
106Gemma 3 27BGoogleOpen286
107Llama 4 ScoutMetaOpen271
108Gemma 4 E2BGoogleOpen270
109Granite-4.0-H-1BIBMOpen269
110Granite-4.0-350MIBMOpen268
111Granite-4.0-1BIBMOpen258
112Llama 3.1 405BMetaOpen255
113Nemotron Ultra 253BNVIDIAOpen238