context.vn

apex Agents Aa

20 models evaluated

#ModelProviderTypeScore
1Gemini 3.5 FlashGoogleClosed47.1%
2GPT-5.5OpenAIClosed37.7%
3GPT-5.4OpenAIClosed33.3%
4Claude Opus 4.6 (Adaptive)AnthropicClosed33.0%
5Gemini 3.1 ProGoogleClosed32.0%
6Kimi K2.6Moonshot AIOpen28.5%
7GPT-5.4 miniOpenAIClosed28.2%
8MiniMax M3MiniMaxOpen27.7%
9GPT-5.4 nanoOpenAIClosed24.9%
10DeepSeek V4 Pro (Max)DeepSeekOpen24.3%
11Grok 4.3xAIClosed17.0%
12Qwen3.5 397B (Reasoning)AlibabaOpen15.3%
13GLM-5Z.AIOpen14.5%
14Gemini 3.1 Flash-LiteGoogleClosed12.2%
15Kimi K2.5 (Reasoning)Moonshot AIClosed11.5%
16Kimi K2.5Moonshot AIOpen11.5%
17MiniMax M2.7MiniMaxOpen10.6%
18GPT-OSS 120BOpenAIOpen3.1%
19MiMo-V2.5-ProXiaomiClosed2.4%
20GPT-OSS 20BOpenAIOpen0.7%