context.vn

mmmu Pro Python

5 models evaluated

#ModelProviderTypeScore
1GPT-5.5OpenAIClosed83.2%
2GPT-5.4OpenAIClosed82.1%
3Kimi K2.6Moonshot AIOpen80.1%
4GPT-5.4 miniOpenAIClosed78%
5GPT-5.4 nanoOpenAIClosed69.5%