context.vn

big Code Bench

2 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro BaseDeepSeekOpen59.2%
2DeepSeek V4 Flash BaseDeepSeekOpen56.8%