context.vn

swe Multimodal

1 models evaluated

#ModelProviderTypeScore
1Claude Opus 4.8AnthropicClosed38.4%