context.vn

gsm8k

2 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro BaseDeepSeekOpen92.6%
2DeepSeek V4 Flash BaseDeepSeekOpen90.8%