context.vn

blueprint Bench2

1 models evaluated

#ModelProviderTypeScore
1Gemini 3.5 FlashGoogleClosed33.6%