APEX
Back to models

Qwen3.5 35b A3b [Q4_K_XL]

LM Studio

262K context<$0.01/M input<$0.01/M output
1246peak 1248

Avg Score

49.4

Avg Cost

$0.05

Score/$

928.7

Runs

105

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchmedium
1710
full-stackhard
1449
from-scratchhard
1430
debugginghard
1419
backendeasy
1413
debuggingexpert
1400
from-scratch
1329
debugging
1325
multi-languagehard
1303
multi-language
1299
backendmedium
1287
backendhard
1284
full-stack
1280
backend
1242
frontendmedium
1230
frontend
1149
frontendexpert
1020
code-review
973
backendexpert
954
code-reviewmedium
924
multi-languageexpert
885
refactoring
880
refactoringmedium
856
debuggingmedium
598
full-stackmedium
310
code-reviewhard
193
from-scratcheasy
0
refactoringexpert
0
from-scratchexpert
0
frontendhard
0
frontendeasy
0

All Results

TaskCategoryScore
Write tests for untested legacy Flask servicecode-review36.3
Add GraphQL layer over REST APImulti-language
Build MCP server for database managementbackend66.5
Implement Stripe webhook handlerbackend
Write Kubernetes manifests for Node.js microservicefull-stack28.0
Implement multi-tenant row-level security in Postgresbackend22.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review28.0
Remove AI slop and over-engineering from codebaserefactoring28.0
Add file upload with S3 presigned URLsbackend73.2
Fix hallucination and context window bugs in RAG agentbackend66.3
Split 1100-line god file into proper modulesrefactoring28.0
Debug race condition in worker pooldebugging28.0
Add caching layer to eliminate slow SSR page loadsfull-stack28.0
Build codebase indexer for LLM context windowsfrom-scratch49.5
Convert React app to PWA with offline supportfrontend28.0
Fix memory leak in event handlerdebugging
Fix deadlocking transaction patterns in Flask appbackend22.0
Build terminal UI dashboardfrom-scratch65.5
Fix Node.js stream backpressure causing OOM on large filesbackend
Build real-time portfolio risk calculatorbackend52.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging28.0
Add retry logic and dead letter queue to Python task queuebackend28.0
Add Google OAuth2 login to Express appfull-stack68.4
Optimize slow Postgres queries in Flask appbackend28.0
Migrate callback-hell Express app to async/awaitrefactoring28.0
Add Redis caching layer to Express APIbackend
Build SaaS admin dashboard from scratchfrom-scratch56.5
Implement transformer inference engine with KV cachefrom-scratch28.0
Debug and fix 6 broken database triggers and constraintsdebugging82.0
Implement background job scheduler with persistencebackend31.3
Code review: identify security vulnscode-review28.0
Build distributed node cluster with gossip protocolfrom-scratch45.8
Fix flaky test suitedebugging28.0
Build REST API from scratchfrom-scratch22.0
Fix 12 WCAG accessibility violations in checkout formfrontend28.0
Build LLM evaluation harness with structured gradingbackend28.0
Fix N+1 query in dashboardbackend50.0
Optimize bloated React bundle under 500KBfrontend28.0
Fix race conditions in order matching enginebackend22.0
Implement zero-trust API authentication layerbackend44.9
Dockerize Node.js monorepofull-stack28.0
Fix broken responsive layoutfrontend22.0
Add i18n with locale routing to Next.js appfull-stack28.0
Build production website with auth and members areafrontend58.6
Fix data integrity bugs in denormalized e-commerce schemadebugging81.7
Fix auth bypass vulnerabilitydebugging
Refactor monolithic handler to CQRSrefactoring46.2
Fix broken GitHub Actions CI pipelinedebugging55.9
Write complex SQL report with window functionsbackend61.9
Implement JWT auth middlewarebackend78.5
Zero-downtime schema migrationfull-stack90.0
Build CLI tool with subcommands and configfrom-scratch67.5
Port Python CLI to Rustmulti-language41.5
Add slash commands and moderation to Discord botbackend68.0
Add streaming SSE endpoint for LLM chatbackend66.4
Add rate limiting middlewarebackend69.4
Add WebSocket real-time updatesfull-stack73.8
Write integration tests for payment flowcode-review46.0
Replace console.log with structured loggingrefactoring42.3
Add virtual scrolling to table rendering 5000 rowsfrontend42.5
Find and fix 4 hidden backdoors in Flask appdebugging87.4
Fix React hydration mismatchfrontend50.0
Build RAG pipeline with vector searchbackend44.5
Build materialized view refresh pipeline for analyticsbackend83.1
Add cursor-based pagination to REST APIbackend42.0
Build MCP server for database managementbackend54.0
Implement transformer inference engine with KV cachefrom-scratch58.7
Build SaaS admin dashboard from scratchfrom-scratch51.8
Implement background job scheduler with persistencebackend28.7
Build production website with auth and members areafrontend53.5
Build CLI tool with subcommands and configfrom-scratch3.3
Fix hallucination and context window bugs in RAG agentbackend45.0
Build LLM evaluation harness with structured gradingbackend37.7
Build real-time portfolio risk calculatorbackend47.0
Fix race conditions in order matching enginebackend81.9
Fix data integrity bugs in denormalized e-commerce schemadebugging54.9
Build materialized view refresh pipeline for analyticsbackend47.9
Fix deadlocking transaction patterns in Flask appbackend46.3
Debug and fix 6 broken database triggers and constraintsdebugging52.1
Write complex SQL report with window functionsbackend44.0
Find and fix 4 hidden backdoors in Flask appdebugging89.0
Add Redis caching layer to Express APIbackend69.2
Write tests for untested legacy Flask servicecode-review50.1
Optimize slow Postgres queries in Flask appbackend74.4
Add slash commands and moderation to Discord botbackend36.6
Add retry logic and dead letter queue to Python task queuebackend66.8
Fix Node.js stream backpressure causing OOM on large filesbackend73.8
Add virtual scrolling to table rendering 5000 rowsfrontend74.1
Fix 12 WCAG accessibility violations in checkout formfrontend68.8
Build distributed node cluster with gossip protocolfrom-scratch24.8
Fix auth bypass vulnerabilitydebugging92.3
Add GraphQL layer over REST APImulti-language69.8
Write integration tests for payment flowcode-review41.7
Zero-downtime schema migrationfull-stack48.0
Add rate limiting middlewarebackend66.1
Implement Stripe webhook handlerbackend45.3
Fix flaky test suitedebugging41.0
Add cursor-based pagination to REST APIbackend34.5
Fix N+1 query in dashboardbackend36.8
Fix memory leak in event handlerdebugging56.7
Refactor monolithic handler to CQRSrefactoring31.8
Debug race condition in worker pooldebugging88.5
Fix React hydration mismatchfrontend72.5
Build terminal UI dashboardfrom-scratch28.7
Build REST API from scratchfrom-scratch77.9