context.vn

corpus Qa1m

6 models evaluated

#ModelProviderTypeScore
1DeepSeek V4 Pro (Max)DeepSeekOpen62.0%
2DeepSeek V4 Flash (Max)DeepSeekOpen60.5%
3DeepSeek V4 Flash (High)DeepSeekOpen59.3%
4DeepSeek V4 Pro (High)DeepSeekOpen56.5%
5DeepSeek V4 ProDeepSeekOpen35.6%
6DeepSeek V4 FlashDeepSeekOpen15.5%