context.vn

multi Swe Bench

1 models evaluated

#ModelProviderTypeScore
1MiniMax M2.7MiniMaxOpen52.7%