APEX
Back to models

Minimax M2.5

OpenRouter

205K context$0.30/M input$1.20/M output
1381peak 1382

Avg Score

65.5

Avg Cost

$0.14

Score/$

473.3

Runs

123

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
2041
frontendhard
1713
debugginghard
1608
code-reviewhard
1599
frontendexpert
1532
frontend
1474
backendexpert
1456
frontendmedium
1453
full-stack
1450
full-stackhard
1445
full-stackmedium
1443
debugging
1417
backendmedium
1413
frontendeasy
1412
refactoringmedium
1369
code-review
1368
refactoring
1349
backend
1341
from-scratch
1340
code-reviewmedium
1320
from-scratcheasy
1314
debuggingexpert
1313
from-scratchhard
1262
multi-language
1243
backendhard
1189
debuggingmedium
1169
multi-languagehard
1035
multi-languageexpert
817
backendeasy
636
refactoringexpert
477
from-scratchmedium
0

All Results

TaskCategoryScore
Port Python CLI to Rustmulti-language39.6
Build SaaS admin dashboard from scratchfrom-scratch53.0
Implement zero-trust API authentication layerbackend81.7
Build real-time portfolio risk calculatorbackend79.0
Build codebase indexer for LLM context windowsfrom-scratch70.1
Code review: identify security vulnscode-review81.3
Split 1100-line god file into proper modulesrefactoring71.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging65.8
Fix broken responsive layoutfrontend71.1
Write complex SQL report with window functionsbackend60.5
Optimize bloated React bundle under 500KBfrontend73.2
Implement background job scheduler with persistencebackend31.9
Build materialized view refresh pipeline for analyticsbackend74.5
Optimize slow Postgres queries in Flask appbackend69.7
Fix 12 WCAG accessibility violations in checkout formfrontend81.6
Implement multi-tenant row-level security in Postgresbackend44.8
Fix race conditions in order matching enginebackend59.6
Debug race condition in worker pooldebugging85.1
Build MCP server for database managementbackend53.5
Fix Node.js stream backpressure causing OOM on large filesbackend32.8
Add cursor-based pagination to REST APIbackend72.0
Fix auth bypass vulnerabilitydebugging89.7
Write integration tests for payment flowcode-review73.3
Fix flaky test suitedebugging64.5
Add slash commands and moderation to Discord botbackend75.9
Add WebSocket real-time updatesfull-stack73.3
Replace console.log with structured loggingrefactoring54.6
Fix hallucination and context window bugs in RAG agentbackend45.6
Add GraphQL layer over REST APImulti-language63.8
Build REST API from scratchfrom-scratch74.6
Add Redis caching layer to Express APIbackend79.8
Remove AI slop and over-engineering from codebaserefactoring75.1
Add Google OAuth2 login to Express appfull-stack81.2
Convert React app to PWA with offline supportfrontend80.2
Fix data integrity bugs in denormalized e-commerce schemadebugging74.5
Build terminal UI dashboardfrom-scratch31.6
Fix N+1 query in dashboardbackend45.0
Implement Stripe webhook handlerbackend53.3
Fix broken GitHub Actions CI pipelinedebugging75.7
Write tests for untested legacy Flask servicecode-review38.1
Add virtual scrolling to table rendering 5000 rowsfrontend40.3
Fix React hydration mismatchfrontend78.0
Build CLI tool with subcommands and configfrom-scratch29.4
Find and fix 4 hidden backdoors in Flask appdebugging90.3
Dockerize Node.js monorepofull-stack71.5
Implement JWT auth middlewarebackend53.9
Implement transformer inference engine with KV cachefrom-scratch85.4
Build RAG pipeline with vector searchbackend42.0
Build LLM evaluation harness with structured gradingbackend43.5
Add i18n with locale routing to Next.js appfull-stack68.3
Write Kubernetes manifests for Node.js microservicefull-stack82.2
Fix memory leak in event handlerdebugging81.0
Add streaming SSE endpoint for LLM chatbackend77.3
Fix deadlocking transaction patterns in Flask appbackend78.0
Add rate limiting middlewarebackend43.0
Debug and fix 6 broken database triggers and constraintsdebugging72.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review68.0
Build production website with auth and members areafrontend67.3
Refactor monolithic handler to CQRSrefactoring40.6
Add file upload with S3 presigned URLsbackend77.3
Zero-downtime schema migrationfull-stack63.0
Add caching layer to eliminate slow SSR page loadsfull-stack80.1
Add retry logic and dead letter queue to Python task queuebackend74.8
Migrate callback-hell Express app to async/awaitrefactoring62.8
Build distributed node cluster with gossip protocolfrom-scratch36.6
Build materialized view refresh pipeline for analyticsbackend72.6
Fix broken responsive layoutfrontend59.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.0
Add caching layer to eliminate slow SSR page loadsfull-stack81.9
Replace console.log with structured loggingrefactoring40.0
Implement multi-tenant row-level security in Postgresbackend70.8
Implement JWT auth middlewarebackend70.3
Add i18n with locale routing to Next.js appfull-stack69.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging74.0
Build codebase indexer for LLM context windowsfrom-scratch33.0
Add file upload with S3 presigned URLsbackend66.0
Split 1100-line god file into proper modulesrefactoring71.8
Write Kubernetes manifests for Node.js microservicefull-stack86.1
Dockerize Node.js monorepofull-stack77.3
Remove AI slop and over-engineering from codebaserefactoring78.0
Optimize bloated React bundle under 500KBfrontend75.3
Convert React app to PWA with offline supportfrontend64.9
Implement zero-trust API authentication layerbackend67.7
Build CLI tool with subcommands and configfrom-scratch37.8
Build production website with auth and members areafrontend69.6
Build MCP server for database managementbackend50.1
Implement transformer inference engine with KV cachefrom-scratch87.7
Build SaaS admin dashboard from scratchfrom-scratch66.3
Implement background job scheduler with persistencebackend62.6
Build real-time portfolio risk calculatorbackend46.4
Fix hallucination and context window bugs in RAG agentbackend69.8
Build LLM evaluation harness with structured gradingbackend48.5
Fix race conditions in order matching enginebackend70.0
Fix data integrity bugs in denormalized e-commerce schemadebugging62.9
Write complex SQL report with window functionsbackend48.8
Fix deadlocking transaction patterns in Flask appbackend58.8
Debug and fix 6 broken database triggers and constraintsdebugging57.0
Find and fix 4 hidden backdoors in Flask appdebugging55.8
Write tests for untested legacy Flask servicecode-review36.6
Add Google OAuth2 login to Express appfull-stack69.2
Optimize slow Postgres queries in Flask appbackend66.8
Add slash commands and moderation to Discord botbackend67.2
Add retry logic and dead letter queue to Python task queuebackend57.6
Fix Node.js stream backpressure causing OOM on large filesbackend71.2
Fix 12 WCAG accessibility violations in checkout formfrontend80.8
Add virtual scrolling to table rendering 5000 rowsfrontend65.8
Build distributed node cluster with gossip protocolfrom-scratch36.8
Write integration tests for payment flowcode-review55.8
Add GraphQL layer over REST APImulti-language81.3
Fix auth bypass vulnerabilitydebugging92.6
Add rate limiting middlewarebackend69.8
Zero-downtime schema migrationfull-stack69.0
Implement Stripe webhook handlerbackend67.5
Fix flaky test suitedebugging80.9
Add cursor-based pagination to REST APIbackend63.6
Fix N+1 query in dashboardbackend78.2
Build terminal UI dashboardfrom-scratch61.5
Fix memory leak in event handlerdebugging62.8
Code review: identify security vulnscode-review78.3
Refactor monolithic handler to CQRSrefactoring64.7
Debug race condition in worker pooldebugging80.8
Fix React hydration mismatchfrontend74.8
Build REST API from scratchfrom-scratch85.6