APEX
Back to models

Deepseek V3.2

OpenRouter

164K context$0.25/M input$0.38/M output
1387peak 1388

Avg Score

64.0

Avg Cost

$0.04

Score/$

1463.2

Runs

80

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
1638
code-reviewmedium
1564
code-review
1497
debuggingmedium
1488
debugginghard
1482
frontendhard
1473
frontendeasy
1472
frontendexpert
1467
frontend
1431
frontendmedium
1419
debugging
1418
backendhard
1413
backendexpert
1406
backend
1394
backendmedium
1375
debuggingexpert
1361
from-scratcheasy
1338
refactoringmedium
1334
refactoring
1317
from-scratch
1312
full-stack
1284
multi-language
1281
code-reviewhard
1269
full-stackhard
1268
full-stackmedium
1267
from-scratchmedium
1252
from-scratchhard
1244
from-scratchexpert
1008
backendeasy
636
refactoringexpert
417
multi-languagehard
0

All Results

TaskCategoryScore
Build distributed node cluster with gossip protocolfrom-scratch38.7
Write complex SQL report with window functionsbackend70.6
Find and fix 4 hidden backdoors in Flask appdebugging72.5
Convert React app to PWA with offline supportfrontend63.6
Debug and fix 6 broken database triggers and constraintsdebugging72.5
Add retry logic and dead letter queue to Python task queuebackend38.0
Implement zero-trust API authentication layerbackend76.8
Implement multi-tenant row-level security in Postgresbackend60.0
Implement background job scheduler with persistencebackend62.0
Add file upload with S3 presigned URLsbackend40.0
Optimize slow Postgres queries in Flask appbackend63.1
Add i18n with locale routing to Next.js appfull-stack65.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.9
Build real-time portfolio risk calculatorbackend52.9
Fix memory leak in event handlerdebugging39.3
Remove AI slop and over-engineering from codebaserefactoring79.1
Fix broken GitHub Actions CI pipelinedebugging90.8
Add Redis caching layer to Express APIbackend66.2
Add Google OAuth2 login to Express appfull-stack8.6
Add GraphQL layer over REST APImulti-language33.3
Add streaming SSE endpoint for LLM chatbackend84.0
Fix auth bypass vulnerabilitydebugging95.0
Migrate callback-hell Express app to async/awaitrefactoring58.7
Port Python CLI to Rustmulti-language54.3
Build materialized view refresh pipeline for analyticsbackend62.9
Build RAG pipeline with vector searchbackend51.3
Code review: identify security vulnscode-review83.8
Optimize slow Postgres queries in Flask appbackend84.8
Add WebSocket real-time updatesfull-stack75.3
Implement zero-trust API authentication layerbackend68.3
Optimize bloated React bundle under 500KBfrontend68.5
Fix broken responsive layoutfrontend72.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.5
Convert React app to PWA with offline supportfrontend65.0
Implement JWT auth middlewarebackend69.8
Split 1100-line god file into proper modulesrefactoring75.9
Dockerize Node.js monorepofull-stack67.3
Write Kubernetes manifests for Node.js microservicefull-stack71.5
Add caching layer to eliminate slow SSR page loadsfull-stack78.5
Remove AI slop and over-engineering from codebaserefactoring85.7
Build codebase indexer for LLM context windowsfrom-scratch27.0
Implement multi-tenant row-level security in Postgresbackend72.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.5
Replace console.log with structured loggingrefactoring40.8
Add i18n with locale routing to Next.js appfull-stack55.5
Add rate limiting middlewarebackend43.1
Build production website with auth and members areafrontend65.7
Build SaaS admin dashboard from scratchfrom-scratch65.0
Fix hallucination and context window bugs in RAG agentbackend53.6
Build LLM evaluation harness with structured gradingbackend64.2
Implement background job scheduler with persistencebackend49.1
Build MCP server for database managementbackend81.8
Build CLI tool with subcommands and configfrom-scratch53.8
Implement transformer inference engine with KV cachefrom-scratch68.9
Build real-time portfolio risk calculatorbackend46.1
Fix race conditions in order matching enginebackend81.5
Fix data integrity bugs in denormalized e-commerce schemadebugging76.9
Write tests for untested legacy Flask servicecode-review56.9
Fix deadlocking transaction patterns in Flask appbackend47.0
Write complex SQL report with window functionsbackend73.0
Debug and fix 6 broken database triggers and constraintsdebugging57.8
Find and fix 4 hidden backdoors in Flask appdebugging71.0
Write integration tests for payment flowcode-review68.1
Fix 12 WCAG accessibility violations in checkout formfrontend77.3
Add retry logic and dead letter queue to Python task queuebackend63.5
Add slash commands and moderation to Discord botbackend63.5
Fix Node.js stream backpressure causing OOM on large filesbackend80.8
Add virtual scrolling to table rendering 5000 rowsfrontend72.6
Build distributed node cluster with gossip protocolfrom-scratch47.3
Add cursor-based pagination to REST APIbackend85.9
Build terminal UI dashboardfrom-scratch59.7
Zero-downtime schema migrationfull-stack63.1
Refactor monolithic handler to CQRSrefactoring40.0
Implement Stripe webhook handlerbackend53.5
Fix flaky test suitedebugging58.0
Build REST API from scratchfrom-scratch76.3
Fix React hydration mismatchfrontend80.3
Fix N+1 query in dashboardbackend53.9
Fix memory leak in event handlerdebugging54.5
Debug race condition in worker pooldebugging90.5