APEX
Back to models

Minimax M2.5 [Q4_K_XL]

LM Studio

197K context<$0.01/M input<$0.01/M output
1372peak 1373

Avg Score

63.9

Avg Cost

$0.03

Score/$

2128.9

Runs

39

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratcheasy
2230
frontendexpert
2135
multi-languagehard
1926
frontendhard
1795
frontend
1631
code-reviewmedium
1631
backendeasy
1627
multi-language
1625
frontendmedium
1568
debugginghard
1474
backendhard
1367
from-scratch
1362
code-review
1358
backendmedium
1339
backend
1326
debugging
1295
refactoring
1289
from-scratchhard
1205
full-stack
1204
full-stackhard
1196
backendexpert
1126
from-scratchmedium
1120
from-scratchexpert
985
debuggingexpert
974
refactoringexpert
963
debuggingmedium
772
code-reviewhard
85

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend37.0
Build MCP server for database managementbackend47.0
Implement transformer inference engine with KV cachefrom-scratch67.7
Build CLI tool with subcommands and configfrom-scratch41.5
Build production website with auth and members areafrontend75.5
Build SaaS admin dashboard from scratchfrom-scratch63.2
Fix data integrity bugs in denormalized e-commerce schemadebugging52.9
Build terminal UI dashboardfrom-scratch55.6
Build real-time portfolio risk calculatorbackend21.8
Write tests for untested legacy Flask servicecode-review65.9
Add slash commands and moderation to Discord botbackend57.1
Fix deadlocking transaction patterns in Flask appbackend68.5
Implement Stripe webhook handlerbackend61.2
Build REST API from scratchfrom-scratch90.1
Fix N+1 query in dashboardbackend64.5
Fix 12 WCAG accessibility violations in checkout formfrontend82.9
Add retry logic and dead letter queue to Python task queuebackend63.5
Fix auth bypass vulnerabilitydebugging93.7
Refactor monolithic handler to CQRSrefactoring51.8
Fix hallucination and context window bugs in RAG agentbackend67.7
Fix race conditions in order matching enginebackend66.5
Debug and fix 6 broken database triggers and constraintsdebugging58.4
Add Redis caching layer to Express APIbackend75.8
Fix flaky test suitedebugging44.8
Optimize slow Postgres queries in Flask appbackend81.7
Fix Node.js stream backpressure causing OOM on large filesbackend81.8
Fix React hydration mismatchfrontend77.7
Build distributed node cluster with gossip protocolfrom-scratch30.5
Find and fix 4 hidden backdoors in Flask appdebugging92.5
Debug race condition in worker pooldebugging68.7
Write integration tests for payment flowcode-review39.5
Add virtual scrolling to table rendering 5000 rowsfrontend79.0
Build LLM evaluation harness with structured gradingbackend41.6
Fix memory leak in event handlerdebugging61.1
Write complex SQL report with window functionsbackend78.5
Add rate limiting middlewarebackend75.4
Add cursor-based pagination to REST APIbackend67.6
Zero-downtime schema migrationfull-stack62.4
Add GraphQL layer over REST APImulti-language80.5