context.vn

os World Verified

21 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.8AnthropicClosed83.4%
2Holo3-35B-A3BH CompanyOpen82.6%
3Claude Mythos PreviewAnthropicClosed79.6%
4Holo3-122B-A10BH CompanyClosed78.8%
5GPT-5.5OpenAIClosed78.7%
6Gemini 3.5 FlashGoogleClosed78.4%
7Claude Opus 4.7 (Adaptive)AnthropicClosed78%
8GPT-5.4OpenAIClosed75%
9Kimi K2.6Moonshot AIOpen73.1%
10Claude Opus 4.6AnthropicClosed72.7%
11Claude Sonnet 4.6AnthropicClosed72.1%
12GPT-5.4 miniOpenAIClosed72.1%
13MiniMax M3MiniMaxOpen70.1%
14Claude Opus 4.5AnthropicClosed66.3%
15GPT-5.3 CodexOpenAIClosed64.7%
16Claude Sonnet 4.5AnthropicClosed61.4%
17Qwen3.5-122B-A10BAlibabaOpen58%
18Qwen3.5-27BAlibabaOpen56.2%
19Qwen3.5-35B-A3BAlibabaOpen54.5%
20GPT-5.2OpenAIClosed47.3%
21GPT-5.4 nanoOpenAIClosed39%