context.vn

mmmlu

4 models evaluated

#ModelProviderTypeScore
1Interfaze BetaInterfazeClosed90.9%
2Qwen3.7 MaxAlibabaClosed90.3%
3DeepSeek V4 Pro BaseDeepSeekOpen90.3%
4DeepSeek V4 Flash BaseDeepSeekOpen88.8%