context.vn

omniscience Accuracy

114 models evaluated

#ModelProviderTypeScore
1GPT-5.5OpenAIClosed56.9%
2Gemini 3 ProGoogleClosed55.9%
3Gemini 3.1 ProGoogleClosed55.3%
4Gemini 3.5 FlashGoogleClosed51.9%
5GPT-5.3 CodexOpenAIClosed51.8%
6GPT-5.4OpenAIClosed50.0%
7Claude Opus 4.6 (Adaptive)AnthropicClosed46.4%
8Claude Opus 4.7 (Adaptive)AnthropicClosed45.8%
9Claude Opus 4.5 ThinkingAnthropicClosed45.7%
10Gemini 3 FlashGoogleClosed45.5%
11Claude Opus 4.6AnthropicClosed45.2%
12Muse SparkMetaClosed44.6%
13GPT-5.2OpenAIClosed43.8%
14Claude Opus 4.7AnthropicClosed43.5%
15DeepSeek V4 Pro (Max)DeepSeekOpen43.3%
16DeepSeek V4 Pro (High)DeepSeekOpen41.8%
17Grok 4xAIClosed41.4%
18GPT-5 (high)OpenAIClosed40.7%
19GPT-5.2-CodexOpenAIClosed40.7%
20Claude Opus 4.5AnthropicClosed40.7%
21GPT-5.1-Codex-MaxOpenAIClosed39.2%
22GPT-5.1-CodexOpenAIClosed39.2%
23Gemini 2.5 ProGoogleClosed39.0%
24GPT-5 (medium)OpenAIClosed38.9%
25o3OpenAIClosed38.4%
26Claude Sonnet 4.6AnthropicClosed38.0%
27Qwen 3.6 Max (preview)AlibabaClosed37.7%
28GPT-5.1OpenAIClosed37.6%
29GPT-5.4 miniOpenAIClosed37.5%
30DeepSeek V4 Flash (Max)DeepSeekOpen37.2%
31Gemini 3.1 Flash-LiteGoogleClosed36.4%
32DeepSeek V4 Flash (High)DeepSeekOpen35.5%
33o1OpenAIClosed34.7%
34Grok 4.3xAIClosed34.6%
35Kimi K2.5 (Reasoning)Moonshot AIClosed34.3%
36Kimi K2.5Moonshot AIOpen34.3%
37Kimi K2.6Moonshot AIOpen32.8%
38Qwen3.5 397B (Reasoning)AlibabaOpen31.4%
39DeepSeek-R1DeepSeekOpen31.0%
40Qwen3.7 MaxAlibabaClosed30.1%
41GLM-4.7Z.AIOpen29.3%
42GLM-5V-TurboZ.AIClosed29.1%
43GLM-5-TurboZ.AIClosed29.0%
44DeepSeek V3.1 (Reasoning)DeepSeekOpen28.8%
45Hy3 PreviewTencentOpen28.0%
46GLM-5Z.AIOpen26.9%
47Kimi K2Moonshot AIClosed26.8%
48MiMo-V2-ProXiaomiClosed26.8%
49Gemini 2.5 FlashGoogleClosed26.5%
50Qwen3.6 PlusAlibabaClosed26.2%
51MiniMax M2.7MiniMaxOpen26.1%
52DeepSeek V3DeepSeekOpen25.4%
53GPT-5.4 nanoOpenAIClosed25.4%
54Grok 4.1 Fast (Reasoning)xAIClosed25.3%
55Mistral Medium 3.5 128BMistralOpen25.1%
56Qwen3.5-122B-A10BAlibabaOpen24.7%
57Qwen3 MaxAlibabaClosed24.4%
58Qwen3.5 397BAlibabaOpen24.3%
59Llama 4 MaverickMetaOpen24.3%
60GLM-5.1Z.AIOpen24.2%
61DeepSeek V3.2DeepSeekOpen24.2%
62GPT-4.1OpenAIClosed24.2%
63Mistral Large 3MistralClosed24.1%
64Grok Code Fast 1xAIClosed23.8%
65DeepSeek V3.1DeepSeekOpen23.1%
66Trinity-Large-PreviewArcee AIOpen22.8%
67Trinity-Large-ThinkingArcee AIOpen22.8%
68MiMo-V2.5-ProXiaomiClosed22.6%
69Grok 4 Fast (Reasoning)xAIClosed22.6%
70Claude 4 SonnetAnthropicClosed22.4%
71Llama 3.1 405BMetaOpen22.3%
72Mistral Small 4 (Reasoning)MistralOpen22.1%
73Mistral Small 4MistralOpen22.1%
74GPT-OSS 120BOpenAIOpen21.5%
75Qwen3.5-27BAlibabaOpen21.0%
76GLM-4.6Z.AIOpen20.8%
77Qwen3.5-35B-A3BAlibabaOpen20.5%
78Mistral Large 2MistralClosed20.1%
79Nemotron Ultra 253BNVIDIAOpen19.9%
80Gemma 4 31BGoogleOpen19.9%
81GPT-4oOpenAIClosed19.7%
82Qwen3.6-27BAlibabaOpen19.2%
83Qwen3.6-35B-A3BAlibabaOpen18.9%
84MiMo-V2-OmniXiaomiClosed18.7%
85Mistral Medium 3MistralClosed18.3%
86Gemma 4 26B A4BGoogleOpen18.2%
87Sarvam 105BSarvamOpen17.6%
88GPT-4.1 miniOpenAIClosed17.5%
89Claude 3 HaikuAnthropicClosed17.2%
90Grok 4.1 FastxAIClosed17.0%
91Nova ProAmazonClosed17.0%
92K-ExaoneLG AI ResearchClosed16.5%
93Solar Pro 2UpstageClosed15.6%
94GLM-4.5-AirZ.AIClosed15.5%
95GPT-OSS 20BOpenAIOpen15.5%
96Ling 2.6 FlashInclusionAIOpen15.4%
97MiMo-V2-FlashXiaomiOpen15.2%
98Nemotron 3 Nano Omni 30B A3BNVIDIAOpen14.8%
99Llama 4 ScoutMetaOpen14.6%
100GPT-4.1 nanoOpenAIClosed13.3%
101Phi-4MicrosoftOpen13.2%
102Sarvam 30BSarvamOpen12.7%
103Gemma 3 27BGoogleOpen12.5%
104Nemotron 3 Nano 30BNVIDIAOpen11.4%
105Exaone 4.0 32BLG AI ResearchOpen10.4%
106Command A+CohereOpen8.9%
107LFM2.5-8B-A1BLiquidAIOpen8.7%
108Gemma 4 E4BGoogleOpen8.6%
109Gemma 4 E2BGoogleOpen6.7%
110Granite-4.0-1BIBMOpen6.1%
111Granite-4.0-H-1BIBMOpen5.3%
112Exaone 4.0 1.2BLG AI ResearchOpen4.7%
113Granite-4.0-H-350MIBMOpen3.7%
114Granite-4.0-350MIBMOpen3.2%