context.vn

program Bench

8 models evaluated

#ModelProviderTypeScore
5Claude Opus 4.6claude-opus-4-6-programbenchAnthropicClosed
7Claude Sonnet 4.6claude-sonnet-4-6-programbenchAnthropicClosed
8GPT-5.4gpt-5-4-programbenchOpenAIClosed
9Gemini 3.1 Progemini-3-1-pro-programbenchGoogleClosed
10Gemini 3 Flashgemini-3-flash-programbenchGoogleClosed
11Claude Haiku 4.5claude-haiku-4-5-programbenchAnthropicClosed
12GPT-5.4 minigpt-5-4-mini-programbenchOpenAIClosed
13GPT-5 minigpt-5-mini-programbenchOpenAIClosed