context.vn

swe Rebench

13 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.6AnthropicClosed65.3%
2GLM-5Z.AIOpen62.8%
3GLM-5.1Z.AIOpen62.7%
4DeepSeek V3.2DeepSeekOpen60.9%
5Claude Sonnet 4.6AnthropicClosed60.7%
6Qwen3.5-27BAlibabaOpen58.9%
7GLM-4.7Z.AIOpen58.7%
8Kimi K2.5Moonshot AIOpen58.5%
9GPT-5.3 CodexOpenAIClosed58.2%
10Composer 2CursorClosed58%
11Qwen3.5-35B-A3BAlibabaOpen53.7%
12MiniMax M2.7MiniMaxOpen51.9%
13Gemma 4 31BGoogleOpen41.6%