| # | Model | Provider | Type | Score | |
|---|---|---|---|---|---|
| 1 | GPT-5.5 | OpenAI | Closed | 83.2% | |
| 2 | GPT-5.4 | OpenAI | Closed | 82.1% | |
| 3 | Kimi K2.6 | Moonshot AI | Open | 80.1% | |
| 4 | GPT-5.4 mini | OpenAI | Closed | 78% | |
| 5 | GPT-5.4 nano | OpenAI | Closed | 69.5% |
| # | Model | Provider | Type | Score | |
|---|---|---|---|---|---|
| 1 | GPT-5.5 | OpenAI | Closed | 83.2% | |
| 2 | GPT-5.4 | OpenAI | Closed | 82.1% | |
| 3 | Kimi K2.6 | Moonshot AI | Open | 80.1% | |
| 4 | GPT-5.4 mini | OpenAI | Closed | 78% | |
| 5 | GPT-5.4 nano | OpenAI | Closed | 69.5% |