context.vn

simple Vqa

7 models evaluated

#ModelProviderTypeScore
1Step 3.7 FlashStepFunOpen79.2%
2Gemini 3.1 ProGoogleClosed72.4%
3Muse SparkMetaClosed71.3%
4GPT-5.4OpenAIClosed61.1%
5Qwen3.6-35B-A3BAlibabaOpen58.9%
6Grok 4.20xAIClosed57.4%
7Qwen3.6-27BAlibabaOpen56.1%