APEX
Back to models

Qwen3.6 35b A3b [BF16]

Qwen

262K context<$0.01/M input<$0.01/M output
1557peak 1558

Avg Score

74.7

Avg Cost

Score/$

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2563
multi-languagehard
2293
refactoringexpert
2022
multi-language
1917
from-scratchmedium
1867
debuggingexpert
1732
frontendhard
1724
from-scratchhard
1692
code-reviewhard
1663
backendexpert
1629
frontendmedium
1616
refactoring
1608
full-stackhard
1606
frontendeasy
1605
frontendexpert
1600
frontend
1595
backendhard
1591
refactoringmedium
1578
full-stack
1571
frontendmaster
1566
debugging
1564
debuggingmedium
1563
from-scratch
1536
backend
1535
full-stackmedium
1521
debugginghard
1508
code-review
1475
backendmedium
1471
code-reviewmedium
1465
from-scratcheasy
1244
backendeasy
1203
backendmaster
1159
from-scratchexpert
1008

All Results

TaskCategoryScore
Build production website with auth and members areafrontend68.2
Implement transformer inference engine with KV cachefrom-scratch69.0
Build interactive data visualization dashboardfrontend71.5
Optimize slow Postgres queries in Flask appbackend86.8
Implement JWT auth middlewarebackend64.3
Fix and extend Chrome browser extensionfrontend65.6
Add i18n with locale routing to Next.js appfull-stack72.5
Build LLM evaluation harness with structured gradingbackend79.9
Add caching layer to eliminate slow SSR page loadsfull-stack86.5
Write tests for untested legacy Flask servicecode-review39.5
Implement zero-trust API authentication layerbackend71.3
Fix memory leak in event handlerdebugging55.3
Add Redis caching layer to Express APIbackend82.6
Convert React app to PWA with offline supportfrontend71.9
Fix broken responsive layoutfrontend73.8
Fix data integrity bugs in denormalized e-commerce schemadebugging85.6
Fix flaky test suitedebugging91.8
Add WebSocket real-time updatesfull-stack79.7
Find and fix 4 hidden backdoors in Flask appdebugging89.7
Build multi-tool LLM agent runtimebackend77.8
Write Kubernetes manifests for Node.js microservicefull-stack80.8
Build distributed node cluster with gossip protocolfrom-scratch70.3
Implement background job scheduler with persistencebackend67.5
Build codebase indexer for LLM context windowsfrom-scratch75.6
Implement Stripe webhook handlerbackend79.2
Fix race conditions in order matching enginebackend83.0
Zero-downtime schema migrationfull-stack77.3
Build real-time portfolio risk calculatorbackend59.4
Remove AI slop and over-engineering from codebaserefactoring81.4
Write complex SQL report with window functionsbackend74.6
Find and patch all OWASP Top 10 vulnerabilitiesdebugging80.9
Debug race condition in worker pooldebugging84.3
Build 3D browser game with physics and multiplayer syncfrontend79.5
Build materialized view refresh pipeline for analyticsbackend73.8
Add file upload with S3 presigned URLsbackend63.6
Build SaaS admin dashboard from scratchfrom-scratch65.3
Build RAG pipeline with vector searchbackend67.8
Fix broken GitHub Actions CI pipelinedebugging72.2
Fix auth bypass vulnerabilitydebugging89.7
Add Google OAuth2 login to Express appfull-stack78.2
Fix Node.js stream backpressure causing OOM on large filesbackend80.9
Optimize bloated React bundle under 500KBfrontend83.2
Split 1100-line god file into proper modulesrefactoring68.7
Migrate Express monolith to modular architecturebackend30.5
Build terminal UI dashboardfrom-scratch67.7
Fix N+1 query in dashboardbackend81.2
Build CLI tool with subcommands and configfrom-scratch68.3
Add GraphQL layer over REST APImulti-language87.3
Implement multi-tenant row-level security in Postgresbackend71.5
Port Python CLI to Rustmulti-language77.0
Fix deadlocking transaction patterns in Flask appbackend85.2
Add rate limiting middlewarebackend66.8
Dockerize Node.js monorepofull-stack70.8
Fix React hydration mismatchfrontend79.5
Debug and fix 6 broken database triggers and constraintsdebugging86.9
Migrate callback-hell Express app to async/awaitrefactoring84.2
Add cursor-based pagination to REST APIbackend74.9
Add slash commands and moderation to Discord botbackend67.9
Fix hallucination and context window bugs in RAG agentbackend80.5
Add streaming SSE endpoint for LLM chatbackend53.8
Fix 12 WCAG accessibility violations in checkout formfrontend82.3
Add retry logic and dead letter queue to Python task queuebackend83.5
Add virtual scrolling to table rendering 5000 rowsfrontend80.8
Code review: identify security vulnscode-review78.6
Build REST API from scratchfrom-scratch73.5
Write integration tests for payment flowcode-review75.2
Refactor monolithic handler to CQRSrefactoring75.0
Replace console.log with structured loggingrefactoring65.8
Harden insecure Docker setup with 12 vulnerabilitiescode-review84.8
Build MCP server for database managementbackend76.9