context.vn

live Code Bench

14 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro (Max)DeepSeekOpen93.5%
2Qwen3.7 MaxAlibabaClosed91.6%
3DeepSeek V4 Flash (Max)DeepSeekOpen91.6%
4DeepSeek V4 Pro (High)DeepSeekOpen89.8%
5Kimi K2.6Moonshot AIOpen89.6%
6DeepSeek V4 Flash (High)DeepSeekOpen88.4%
7Kimi K2.5Moonshot AIOpen85%
8GLM-4.7Z.AIOpen84.9%
9Qwen3.6-27BAlibabaOpen83.9%
10Qwen3.6-35B-A3BAlibabaOpen80.4%
11Nemotron 3 Nano Omni 30B A3BNVIDIAOpen63.2%
12DeepSeek V4 ProDeepSeekOpen56.8%
13DeepSeek V4 FlashDeepSeekOpen55.2%
14DeepSeek V3DeepSeekOpen37.6%