context.vn

arc Agi2

11 models evaluated

#ModelProviderTypeScore
1GPT-5.5OpenAIClosed85%
2GPT-5.4 ProOpenAIClosed83.3%
3Gemini 3.1 ProGoogleClosed77.1%
4Claude Opus 4.7 (Adaptive)AnthropicClosed75.8%
5Gemini 3.5 FlashGoogleClosed72.1%
6Grok 4.20xAIClosed53.3%
7GPT-5.2OpenAIClosed52.9%
8Gemini 3 Pro Deep ThinkGoogleClosed45.1%
9Muse SparkMetaClosed42.5%
10Gemini 3 ProGoogleClosed31.1%
11Claude Sonnet 4.5AnthropicClosed13.6%