APEX
Back to models

Qwen3 Coder Flash

OpenRouter

1000K context$0.30/M input$1.50/M output
1293peak 1294

Avg Score

59.1

Avg Cost

$0.02

Score/$

2664.1

Runs

120

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

code-reviewhard
2086
from-scratchexpert
1553
full-stackmedium
1412
backendmedium
1393
full-stack
1379
code-review
1376
frontendmedium
1363
full-stackhard
1337
multi-languagehard
1303
backend
1288
frontend
1283
debuggingexpert
1279
refactoringmedium
1278
frontendeasy
1264
backendexpert
1260
refactoring
1252
debugging
1241
from-scratch
1240
code-reviewmedium
1239
multi-language
1186
debugginghard
1186
backendhard
1178
from-scratchhard
1143
debuggingmedium
1127
from-scratcheasy
982
from-scratchmedium
884
frontendexpert
673
backendeasy
468
multi-languageexpert
345
frontendhard
208
refactoringexpert
97

All Results

TaskCategoryScore
Fix 12 WCAG accessibility violations in checkout formfrontend52.3
Build SaaS admin dashboard from scratchfrom-scratch51.5
Build materialized view refresh pipeline for analyticsbackend60.4
Fix React hydration mismatchfrontend85.9
Port Python CLI to Rustmulti-language33.0
Implement JWT auth middlewarebackend38.8
Migrate callback-hell Express app to async/awaitrefactoring62.7
Implement zero-trust API authentication layerbackend65.0
Fix N+1 query in dashboardbackend44.4
Add GraphQL layer over REST APImulti-language69.2
Find and fix 4 hidden backdoors in Flask appdebugging42.0
Add cursor-based pagination to REST APIbackend80.2
Implement background job scheduler with persistencebackend39.2
Convert React app to PWA with offline supportfrontend67.7
Write Kubernetes manifests for Node.js microservicefull-stack82.4
Fix memory leak in event handlerdebugging72.0
Write complex SQL report with window functionsbackend70.2
Add virtual scrolling to table rendering 5000 rowsfrontend34.0
Write integration tests for payment flowcode-review80.5
Implement multi-tenant row-level security in Postgresbackend51.5
Add file upload with S3 presigned URLsbackend53.0
Implement transformer inference engine with KV cachefrom-scratch76.2
Add rate limiting middlewarebackend40.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging64.3
Fix Node.js stream backpressure causing OOM on large filesbackend85.9
Fix hallucination and context window bugs in RAG agentbackend30.3
Code review: identify security vulnscode-review75.0
Refactor monolithic handler to CQRSrefactoring33.5
Add slash commands and moderation to Discord botbackend70.8
Replace console.log with structured loggingrefactoring38.5
Build codebase indexer for LLM context windowsfrom-scratch40.2
Build CLI tool with subcommands and configfrom-scratch35.8
Fix race conditions in order matching enginebackend81.5
Split 1100-line god file into proper modulesrefactoring70.8
Add Google OAuth2 login to Express appfull-stack55.6
Fix flaky test suitedebugging69.5
Fix broken GitHub Actions CI pipelinedebugging67.2
Build distributed node cluster with gossip protocolfrom-scratch40.1
Add streaming SSE endpoint for LLM chatbackend67.0
Add i18n with locale routing to Next.js appfull-stack66.8
Build production website with auth and members areafrontend52.0
Build MCP server for database managementbackend53.5
Remove AI slop and over-engineering from codebaserefactoring72.9
Build REST API from scratchfrom-scratch68.5
Add retry logic and dead letter queue to Python task queuebackend64.1
Optimize bloated React bundle under 500KBfrontend62.6
Zero-downtime schema migrationfull-stack80.5
Fix broken responsive layoutfrontend68.5
Build LLM evaluation harness with structured gradingbackend50.0
Optimize slow Postgres queries in Flask appbackend53.0
Build RAG pipeline with vector searchbackend46.2
Debug race condition in worker pooldebugging80.2
Add WebSocket real-time updatesfull-stack56.6
Debug and fix 6 broken database triggers and constraintsdebugging58.7
Build real-time portfolio risk calculatorbackend29.9
Dockerize Node.js monorepofull-stack65.8
Build terminal UI dashboardfrom-scratch52.3
Add Redis caching layer to Express APIbackend74.3
Add caching layer to eliminate slow SSR page loadsfull-stack81.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.6
Fix data integrity bugs in denormalized e-commerce schemadebugging77.7
Implement Stripe webhook handlerbackend59.3
Write tests for untested legacy Flask servicecode-review32.1
Fix auth bypass vulnerabilitydebugging45.8
Fix deadlocking transaction patterns in Flask appbackend63.0
Fix broken responsive layoutfrontend55.5
Remove AI slop and over-engineering from codebaserefactoring77.5
Split 1100-line god file into proper modulesrefactoring65.2
Write Kubernetes manifests for Node.js microservicefull-stack3.5
Implement zero-trust API authentication layerbackend26.1
Implement JWT auth middlewarebackend79.5
Add i18n with locale routing to Next.js appfull-stack57.4
Convert React app to PWA with offline supportfrontend47.0
Add caching layer to eliminate slow SSR page loadsfull-stack71.1
Dockerize Node.js monorepofull-stack53.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging53.5
Optimize bloated React bundle under 500KBfrontend64.0
Replace console.log with structured loggingrefactoring47.8
Implement multi-tenant row-level security in Postgresbackend71.7
Build codebase indexer for LLM context windowsfrom-scratch47.6
Harden insecure Docker setup with 12 vulnerabilitiescode-review62.9
Implement transformer inference engine with KV cachefrom-scratch57.8
Implement background job scheduler with persistencebackend51.3
Add retry logic and dead letter queue to Python task queuebackend74.3
Build MCP server for database managementbackend65.7
Debug and fix 6 broken database triggers and constraintsdebugging63.1
Fix data integrity bugs in denormalized e-commerce schemadebugging57.5
Fix flaky test suitedebugging65.8
Build LLM evaluation harness with structured gradingbackend55.8
Fix race conditions in order matching enginebackend84.4
Add slash commands and moderation to Discord botbackend75.8
Build real-time portfolio risk calculatorbackend39.0
Zero-downtime schema migrationfull-stack85.6
Write complex SQL report with window functionsbackend78.9
Build production website with auth and members areafrontend48.0
Build SaaS admin dashboard from scratchfrom-scratch50.5
Build CLI tool with subcommands and configfrom-scratch40.8
Fix hallucination and context window bugs in RAG agentbackend52.0
Build materialized view refresh pipeline for analyticsbackend30.0
Fix deadlocking transaction patterns in Flask appbackend33.8
Find and fix 4 hidden backdoors in Flask appdebugging80.8
Write tests for untested legacy Flask servicecode-review47.2
Add Google OAuth2 login to Express appfull-stack63.9
Fix 12 WCAG accessibility violations in checkout formfrontend73.0
Optimize slow Postgres queries in Flask appbackend66.1
Add virtual scrolling to table rendering 5000 rowsfrontend66.0
Fix Node.js stream backpressure causing OOM on large filesbackend89.5
Build distributed node cluster with gossip protocolfrom-scratch36.5
Write integration tests for payment flowcode-review67.5
Fix auth bypass vulnerabilitydebugging78.4
Add GraphQL layer over REST APImulti-language57.0
Add rate limiting middlewarebackend50.5
Implement Stripe webhook handlerbackend37.5
Fix N+1 query in dashboardbackend65.4
Refactor monolithic handler to CQRSrefactoring53.9
Fix memory leak in event handlerdebugging63.4
Fix React hydration mismatchfrontend68.3
Build terminal UI dashboardfrom-scratch42.0
Debug race condition in worker pooldebugging58.0
Build REST API from scratchfrom-scratch70.2