context.vn

math Benchmark

2 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro BaseDeepSeekOpen64.5%
2DeepSeek V4 Flash BaseDeepSeekOpen57.4%