context.vn

zero Bench

3 models evaluated

#ModelProviderTypeScore
1GPT-5.4OpenAIClosed41.0%
2Muse SparkMetaClosed33.0%
3Gemini 3.1 ProGoogleClosed29.0%