context.vn

cursor Bench31

7 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.7 (Adaptive)AnthropicClosed64.8%
2Composer 2.5CursorClosed63.2%
3GPT-5.5OpenAIClosed59.2%
4Composer 2CursorClosed52.2%
5Gemini 3.5 FlashGoogleClosed49.8%
6Kimi K2.6Moonshot AIOpen47.6%
7Kimi K2.5Moonshot AIOpen31.9%