context.vn

inference Bench

14 models evaluated

#ModelProviderTypeScore
1Claude Sonnet 4.6AnthropicClosed8.08x
2GLM-5Z.AIOpen6.20x
3Gemini 3.1 ProGoogleClosed6.16x
4GPT-5.3 Codex (High)OpenAIClosed5.48x
5GPT-5.4 (High)OpenAIClosed5.08x
7GPT-5.5 (High)OpenAIClosed4.22x
8Claude Opus 4.6AnthropicClosed3.89x
9GPT-5.2OpenAIClosed3.82x
10GPT-5.1 Codex MaxOpenAIClosed3.54x
11Claude Opus 4.5AnthropicClosed3.37x
12Claude Sonnet 4.5AnthropicClosed2.96x
13Claude Opus 4.7AnthropicClosed2.25x
14GPT-5.2 CodexOpenAIClosed1.55x
15Claude Haiku 4.5AnthropicClosed1.24x