context.vn

humaneval

2 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro BaseDeepSeekOpen76.8%
2DeepSeek V4 Flash BaseDeepSeekOpen69.5%