APEX
Back to models

Qwen3.6 Plus

OpenRouter

1000K context$0.33/M input$1.95/M output
1636peak 1638

Avg Score

77.8

Avg Cost

$0.41

Score/$

191.3

Runs

89

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
2641
frontendexpert
2395
multi-languagehard
2152
backendeasy
2097
from-scratchmedium
1938
from-scratchhard
1913
frontendhard
1847
debuggingmedium
1834
code-reviewmedium
1815
backendexpert
1803
backendhard
1715
code-review
1700
from-scratch
1692
backend
1678
from-scratchexpert
1625
debugging
1620
debuggingexpert
1616
backendmedium
1615
full-stackmedium
1603
debugginghard
1584
refactoring
1582
multi-language
1581
frontend
1578
full-stack
1563
frontendmedium
1534
code-reviewhard
1534
full-stackhard
1532
refactoringmedium
1475
frontendmaster
1475
from-scratcheasy
1461
frontendeasy
1400
backendmaster
1247
multi-languageexpert
1121

All Results

TaskCategoryScore
Add file upload with S3 presigned URLsbackend62.1
Implement JWT auth middlewarebackend79.9
Migrate Express monolith to modular architecturebackend33.1
Build interactive data visualization dashboardfrontend49.9
Optimize bloated React bundle under 500KBfrontend81.0
Build 3D browser game with physics and multiplayer syncfrontend74.5
Add streaming SSE endpoint for LLM chatbackend72.8
Fix memory leak in event handlerdebugging80.0
Build multi-tool LLM agent runtimebackend81.0
Fix and extend Chrome browser extensionfrontend73.0
Convert React app to PWA with offline supportfrontend78.8
Build MCP server for database managementbackend82.1
Replace console.log with structured loggingrefactoring43.7
Build SaaS admin dashboard from scratchfrom-scratch72.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review89.3
Migrate callback-hell Express app to async/awaitrefactoring87.5
Build RAG pipeline with vector searchbackend70.4
Fix hallucination and context window bugs in RAG agentbackend81.2
Fix Node.js stream backpressure causing OOM on large filesbackend88.7
Build distributed node cluster with gossip protocolfrom-scratch79.5
Build LLM evaluation harness with structured gradingbackend83.8
Write complex SQL report with window functionsbackend88.3
Build terminal UI dashboardfrom-scratch69.4
Implement zero-trust API authentication layerbackend76.1
Build CLI tool with subcommands and configfrom-scratch78.8
Add cursor-based pagination to REST APIbackend82.5
Build materialized view refresh pipeline for analyticsbackend76.7
Implement multi-tenant row-level security in Postgresbackend82.8
Implement transformer inference engine with KV cachefrom-scratch78.0
Add Redis caching layer to Express APIbackend81.6
Implement background job scheduler with persistencebackend71.2
Fix broken responsive layoutfrontend70.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging71.9
Write Kubernetes manifests for Node.js microservicefull-stack83.4
Fix broken GitHub Actions CI pipelinedebugging90.9
Fix race conditions in order matching enginebackend84.4
Add caching layer to eliminate slow SSR page loadsfull-stack83.7
Fix deadlocking transaction patterns in Flask appbackend86.8
Debug race condition in worker pooldebugging87.8
Build production website with auth and members areafrontend80.8
Fix 12 WCAG accessibility violations in checkout formfrontend84.4
Split 1100-line god file into proper modulesrefactoring76.8
Write integration tests for payment flowcode-review71.8
Add retry logic and dead letter queue to Python task queuebackend84.5
Add slash commands and moderation to Discord botbackend81.7
Remove AI slop and over-engineering from codebaserefactoring76.7
Debug and fix 6 broken database triggers and constraintsdebugging87.0
Build real-time portfolio risk calculatorbackend79.5
Build codebase indexer for LLM context windowsfrom-scratch72.5
Add Google OAuth2 login to Express appfull-stack61.4
Write tests for untested legacy Flask servicecode-review87.3
Add GraphQL layer over REST APImulti-language84.8
Build REST API from scratchfrom-scratch78.7
Add virtual scrolling to table rendering 5000 rowsfrontend74.0
Optimize slow Postgres queries in Flask appbackend87.3
Port Python CLI to Rustmulti-language45.2
Fix data integrity bugs in denormalized e-commerce schemadebugging81.6
Add i18n with locale routing to Next.js appfull-stack77.0
Refactor monolithic handler to CQRSrefactoring85.8
Code review: identify security vulnscode-review81.1
Add WebSocket real-time updatesfull-stack84.4
Fix N+1 query in dashboardbackend79.0
Fix auth bypass vulnerabilitydebugging85.0
Fix flaky test suitedebugging91.7
Find and fix 4 hidden backdoors in Flask appdebugging89.1
Add rate limiting middlewarebackend82.3
Zero-downtime schema migrationfull-stack72.8
Dockerize Node.js monorepofull-stack78.7
Fix React hydration mismatchfrontend70.2
Implement Stripe webhook handlerbackend86.0
Implement background job scheduler with persistencebackend
Port Python CLI to Rustmulti-language
Fix hallucination and context window bugs in RAG agentbackend
Write integration tests for payment flowcode-review
Add GraphQL layer over REST APImulti-language
Fix React hydration mismatchfrontend
Write tests for untested legacy Flask servicecode-review
Build terminal UI dashboardfrom-scratch
Optimize bloated React bundle under 500KBfrontend
Add rate limiting middlewarebackend
Implement multi-tenant row-level security in Postgresbackend
Code review: identify security vulnscode-review
Add virtual scrolling to table rendering 5000 rowsfrontend
Add streaming SSE endpoint for LLM chatbackend
Zero-downtime schema migrationfull-stack
Fix memory leak in event handlerdebugging
Fix data integrity bugs in denormalized e-commerce schemadebugging
Implement Stripe webhook handlerbackend
Dockerize Node.js monorepofull-stack