context.vn

cmmlu

2 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro BaseDeepSeekOpen90.8%
2DeepSeek V4 Flash BaseDeepSeekOpen90.4%