APEX
Back to models

Minimax M2.7 [NVFP4]

SGLang

197K context<$0.01/M input<$0.01/M output
1649peak 1654

Avg Score

79.4

Avg Cost

Score/$

Runs

46

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2563
refactoringexpert
2440
from-scratcheasy
2215
backendeasy
2046
multi-language
1912
from-scratchhard
1901
code-reviewmedium
1791
refactoring
1769
refactoringmedium
1739
full-stackhard
1738
full-stack
1698
backendexpert
1693
from-scratchmedium
1689
backendhard
1680
debuggingexpert
1676
debugginghard
1663
debugging
1647
code-review
1642
from-scratch
1641
backend
1639
frontendmedium
1633
full-stackmedium
1605
frontend
1575
backendmedium
1567
frontendhard
1485
backendmaster
1464
code-reviewhard
1453
frontendmaster
1242
from-scratchexpert
1052

All Results

TaskCategoryScore
Migrate Express monolith to modular architecturebackend70.3
Implement JWT auth middlewarebackend86.7
Build REST API from scratchfrom-scratch88.7
Write integration tests for payment flowcode-review70.3
Build CLI tool with subcommands and configfrom-scratch74.3
Build codebase indexer for LLM context windowsfrom-scratch77.7
Fix and extend Chrome browser extensionfrontend52.0
Add Redis caching layer to Express APIbackend83.4
Optimize slow Postgres queries in Flask appbackend84.2
Implement multi-tenant row-level security in Postgresbackend78.8
Build materialized view refresh pipeline for analyticsbackend77.3
Fix N+1 query in dashboardbackend74.7
Fix React hydration mismatchfrontend85.8
Split 1100-line god file into proper modulesrefactoring84.0
Remove AI slop and over-engineering from codebaserefactoring81.9
Fix auth bypass vulnerabilitydebugging91.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review86.5
Write Kubernetes manifests for Node.js microservicefull-stack86.3
Build multi-tool LLM agent runtimebackend79.9
Fix hallucination and context window bugs in RAG agentbackend80.8
Fix 12 WCAG accessibility violations in checkout formfrontend77.7
Refactor monolithic handler to CQRSrefactoring82.1
Write tests for untested legacy Flask servicecode-review84.3
Fix race conditions in order matching enginebackend85.8
Build terminal UI dashboardfrom-scratch64.9
Build MCP server for database managementbackend81.5
Debug and fix 6 broken database triggers and constraintsdebugging86.8
Migrate callback-hell Express app to async/awaitrefactoring85.1
Implement background job scheduler with persistencebackend68.3
Convert React app to PWA with offline supportfrontend74.3
Implement zero-trust API authentication layerbackend73.5
Implement transformer inference engine with KV cachefrom-scratch70.9
Build RAG pipeline with vector searchbackend76.5
Write complex SQL report with window functionsbackend82.4
Fix Node.js stream backpressure causing OOM on large filesbackend70.8
Zero-downtime schema migrationfull-stack81.5
Find and fix 4 hidden backdoors in Flask appdebugging91.3
Fix data integrity bugs in denormalized e-commerce schemadebugging85.2
Add rate limiting middlewarebackend80.8
Replace console.log with structured loggingrefactoring77.2
Add Google OAuth2 login to Express appfull-stack77.7
Optimize bloated React bundle under 500KBfrontend75.5
Add virtual scrolling to table rendering 5000 rowsfrontend81.8
Port Python CLI to Rustmulti-language77.5
Add cursor-based pagination to REST APIbackend77.0
Add WebSocket real-time updatesfull-stack86.8