APEX
Back to models

Qwen3.6 27b [BF16]

Qwen

262K context<$0.01/M input<$0.01/M output
1550peak 1566

Avg Score

75.2

Avg Cost

Score/$

Runs

69

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2669
code-reviewhard
2468
refactoringexpert
2440
frontendexpert
2027
from-scratcheasy
1752
code-review
1678
backendexpert
1658
backendmaster
1641
full-stackhard
1628
frontendmaster
1625
backendhard
1615
refactoring
1613
backend
1610
code-reviewmedium
1603
multi-language
1565
debuggingexpert
1565
backendmedium
1552
refactoringmedium
1535
full-stack
1517
debugging
1489
debugginghard
1470
frontendmedium
1467
frontend
1457
debuggingmedium
1454
from-scratch
1400
frontendeasy
1400
full-stackmedium
1391
from-scratchhard
1340
multi-languagehard
1104
from-scratchexpert
1052
frontendhard
416

All Results

TaskCategoryScore
Add i18n with locale routing to Next.js appfull-stack
Build distributed node cluster with gossip protocolfrom-scratch
Add rate limiting middlewarebackend
Build terminal UI dashboardfrom-scratch
Fix Node.js stream backpressure causing OOM on large filesbackend
Build interactive data visualization dashboardfrontend
Implement JWT auth middlewarebackend
Build 3D browser game with physics and multiplayer syncfrontend
Add virtual scrolling to table rendering 5000 rowsfrontend69.5
Add Google OAuth2 login to Express appfull-stack69.5
Build multi-tool LLM agent runtimebackend80.7
Build SaaS admin dashboard from scratchfrom-scratch64.1
Add GraphQL layer over REST APImulti-language64.9
Implement background job scheduler with persistencebackend74.9
Replace console.log with structured loggingrefactoring43.4
Migrate Express monolith to modular architecturebackend82.5
Build CLI tool with subcommands and configfrom-scratch44.1
Fix and extend Chrome browser extensionfrontend69.3
Fix and extend Chrome browser extensionfrontend43.4
Add file upload with S3 presigned URLsbackend75.7
Build multi-tool LLM agent runtimebackend58.4
Add cursor-based pagination to REST APIbackend67.0
Implement transformer inference engine with KV cachefrom-scratch69.5
Write tests for untested legacy Flask servicecode-review77.7
Implement Stripe webhook handlerbackend76.5
Build MCP server for database managementbackend64.8
Fix memory leak in event handlerdebugging81.7
Fix deadlocking transaction patterns in Flask appbackend83.7
Add slash commands and moderation to Discord botbackend78.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.8
Fix race conditions in order matching enginebackend82.5
Optimize slow Postgres queries in Flask appbackend79.0
Fix N+1 query in dashboardbackend82.8
Find and fix 4 hidden backdoors in Flask appdebugging84.3
Build materialized view refresh pipeline for analyticsbackend83.7
Build RAG pipeline with vector searchbackend75.2
Zero-downtime schema migrationfull-stack82.4
Code review: identify security vulnscode-review77.8
Fix auth bypass vulnerabilitydebugging79.7
Convert React app to PWA with offline supportfrontend69.0
Fix broken GitHub Actions CI pipelinedebugging73.7
Port Python CLI to Rustmulti-language79.8
Build LLM evaluation harness with structured gradingbackend79.9
Refactor monolithic handler to CQRSrefactoring81.7
Implement multi-tenant row-level security in Postgresbackend77.3
Fix data integrity bugs in denormalized e-commerce schemadebugging81.5
Dockerize Node.js monorepofull-stack69.3
Build production website with auth and members areafrontend73.7
Write Kubernetes manifests for Node.js microservicefull-stack77.2
Implement zero-trust API authentication layerbackend67.7
Fix hallucination and context window bugs in RAG agentbackend85.5
Build REST API from scratchfrom-scratch82.0
Split 1100-line god file into proper modulesrefactoring78.7
Optimize bloated React bundle under 500KBfrontend67.3
Write complex SQL report with window functionsbackend77.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review82.5
Add caching layer to eliminate slow SSR page loadsfull-stack83.3
Add WebSocket real-time updatesfull-stack85.6
Fix 12 WCAG accessibility violations in checkout formfrontend60.1
Remove AI slop and over-engineering from codebaserefactoring87.5
Add retry logic and dead letter queue to Python task queuebackend78.0
Add Redis caching layer to Express APIbackend79.1
Write integration tests for payment flowcode-review88.5
Migrate callback-hell Express app to async/awaitrefactoring78.0
Fix flaky test suitedebugging84.9
Debug and fix 6 broken database triggers and constraintsdebugging87.8
Fix React hydration mismatchfrontend83.6
Fix broken responsive layoutfrontend70.5
Debug race condition in worker pooldebugging85.2