context.vn

charxiv No Tools

3 models evaluated

#ModelProviderTypeScore
1Claude Mythos PreviewAnthropicClosed86.1%
2Claude Opus 4.7 (Adaptive)AnthropicClosed82.1%
3Claude Opus 4.8AnthropicClosed80.5%