context.vn

deep Search Qa

9 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.8AnthropicClosed93.1%
2Step 3.7 FlashStepFunOpen92.8%
3Kimi K2.6Moonshot AIOpen92.5%
4Kimi K2.5Moonshot AIOpen77.1%
5Muse SparkMetaClosed74.8%
6Claude Opus 4.6AnthropicClosed73.7%
7GPT-5.4OpenAIClosed73.6%
8Gemini 3.1 ProGoogleClosed69.7%
9Grok 4.20xAIClosed62.8%