context.vn

mm Answer Bench

9 models evaluated

#ModelProviderTypeScore
1Kimi K2.6Moonshot AIOpen86.0%
2Claude Opus 4.5AnthropicClosed84.0%
3GLM-5.1Z.AIOpen83.8%
4Qwen3.6 PlusAlibabaClosed83.8%
5GLM-5Z.AIOpen82.5%
6Kimi K2.5Moonshot AIOpen81.8%
7Qwen3.5 397BAlibabaOpen80.9%
8Qwen3.6-27BAlibabaOpen80.8%
9Qwen3.6-35B-A3BAlibabaOpen78.9%