context.vn

swe Multilingual

20 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.8AnthropicClosed84.4%
2Composer 2.5CursorClosed79.8%
3Qwen3.7 MaxAlibabaClosed78.3%
4Claude Opus 4.5AnthropicClosed77.5%
5Kimi K2.6Moonshot AIOpen76.7%
6MiniMax M2.7MiniMaxOpen76.5%
7DeepSeek V4 Pro (Max)DeepSeekOpen76.2%
8DeepSeek V4 Pro (High)DeepSeekOpen74.1%
9Qwen3.6 PlusAlibabaClosed73.8%
10Composer 2CursorClosed73.7%
11DeepSeek V4 Flash (Max)DeepSeekOpen73.3%
12GLM-5Z.AIOpen73.3%
13Kimi K2.5Moonshot AIOpen73%
14Qwen3.6-27BAlibabaOpen71.3%
15DeepSeek V4 Flash (High)DeepSeekOpen70.2%
16DeepSeek V4 ProDeepSeekOpen69.8%
17DeepSeek V4 FlashDeepSeekOpen69.7%
18Qwen3.6-35B-A3BAlibabaOpen67.2%
19Laguna M.1PoolsideClosed63.1%
20Laguna XS.2PoolsideOpen57.7%