context.vn

deep Planning

6 models evaluated

#ModelProviderTypeScore
1Qwen3.6 PlusAlibabaClosed41.5%
2Qwen3.5 397BAlibabaOpen37.6%
3Claude Opus 4.5AnthropicClosed26.4%
4Qwen3.6-35B-A3BAlibabaOpen25.9%
5GLM-5Z.AIOpen14.6%
6Kimi K2.5Moonshot AIOpen14.4%