APEX
Back to models

Qwen3.5 27b [Q4_K_M]

LM Studio

262K context<$0.01/M input<$0.01/M output
1361peak 1362

Avg Score

63.1

Avg Cost

$0.18

Score/$

342.2

Runs

117

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendhard
1816
debuggingmedium
1815
refactoringexpert
1635
frontendexpert
1532
frontend
1442
frontendeasy
1425
debugging
1423
backendmedium
1421
full-stackmedium
1420
full-stack
1395
frontendmedium
1379
code-reviewmedium
1371
code-review
1361
full-stackhard
1361
backend
1345
backendhard
1341
refactoring
1334
debuggingexpert
1319
from-scratcheasy
1314
from-scratchmedium
1303
debugginghard
1301
refactoringmedium
1263
from-scratch
1229
backendexpert
1211
code-reviewhard
1175
from-scratchhard
1118
multi-language
1070
backendeasy
683
multi-languageexpert
535
multi-languagehard
296
from-scratchexpert
194

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend53.0
Fix data integrity bugs in denormalized e-commerce schemadebugging71.8
Build RAG pipeline with vector searchbackend42.6
Migrate callback-hell Express app to async/awaitrefactoring54.1
Build terminal UI dashboardfrom-scratch60.4
Build real-time portfolio risk calculatorbackend56.6
Implement multi-tenant row-level security in Postgresbackend40.3
Build production website with auth and members areafrontend67.3
Optimize bloated React bundle under 500KBfrontend70.8
Fix auth bypass vulnerabilitydebugging28.0
Add file upload with S3 presigned URLsbackend80.9
Write Kubernetes manifests for Node.js microservicefull-stack81.1
Fix React hydration mismatchfrontend73.9
Write tests for untested legacy Flask servicecode-review50.9
Write complex SQL report with window functionsbackend50.1
Build CLI tool with subcommands and configfrom-scratch42.3
Fix N+1 query in dashboardbackend55.9
Optimize slow Postgres queries in Flask appbackend74.1
Implement Stripe webhook handlerbackend78.7
Add i18n with locale routing to Next.js appfull-stack63.8
Build codebase indexer for LLM context windowsfrom-scratch38.8
Build distributed node cluster with gossip protocolfrom-scratch37.4
Add streaming SSE endpoint for LLM chatbackend85.3
Add retry logic and dead letter queue to Python task queuebackend9.6
Add rate limiting middlewarebackend44.3
Remove AI slop and over-engineering from codebaserefactoring78.3
Debug and fix 6 broken database triggers and constraintsdebugging75.5
Fix flaky test suitedebugging89.8
Find and fix 4 hidden backdoors in Flask appdebugging78.7
Add slash commands and moderation to Discord botbackend69.8
Build REST API from scratchfrom-scratch75.1
Write integration tests for payment flowcode-review66.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review83.5
Add caching layer to eliminate slow SSR page loadsfull-stack82.4
Zero-downtime schema migrationfull-stack70.5
Fix broken responsive layoutfrontend71.3
Implement JWT auth middlewarebackend45.7
Add WebSocket real-time updatesfull-stack73.9
Build SaaS admin dashboard from scratchfrom-scratch47.5
Build MCP server for database managementbackend55.8
Add GraphQL layer over REST APImulti-language62.7
Fix hallucination and context window bugs in RAG agentbackend63.0
Fix Node.js stream backpressure causing OOM on large filesbackend79.3
Fix deadlocking transaction patterns in Flask appbackend72.8
Implement transformer inference engine with KV cachefrom-scratch43.0
Replace console.log with structured loggingrefactoring36.4
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.3
Add Google OAuth2 login to Express appfull-stack66.3
Debug race condition in worker pooldebugging82.4
Fix race conditions in order matching enginebackend56.4
Build materialized view refresh pipeline for analyticsbackend74.8
Add Redis caching layer to Express APIbackend81.7
Add cursor-based pagination to REST APIbackend47.3
Dockerize Node.js monorepofull-stack66.6
Split 1100-line god file into proper modulesrefactoring50.3
Fix memory leak in event handlerdebugging44.9
Fix broken GitHub Actions CI pipelinedebugging93.0
Fix 12 WCAG accessibility violations in checkout formfrontend83.3
Convert React app to PWA with offline supportfrontend75.9
Add virtual scrolling to table rendering 5000 rowsfrontend45.5
Implement zero-trust API authentication layerbackend28.0
Port Python CLI to Rustmulti-language35.5
Code review: identify security vulnscode-review49.1
Add GraphQL layer over REST APImulti-language44.5
Migrate callback-hell Express app to async/awaitrefactoring75.4
Implement multi-tenant row-level security in Postgresbackend75.8
Optimize bloated React bundle under 500KBfrontend71.5
Convert React app to PWA with offline supportfrontend44.2
Fix broken responsive layoutfrontend64.7
Dockerize Node.js monorepofull-stack67.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review89.6
Build codebase indexer for LLM context windowsfrom-scratch35.8
Replace console.log with structured loggingrefactoring54.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging70.2
Split 1100-line god file into proper modulesrefactoring71.9
Implement JWT auth middlewarebackend77.5
Add caching layer to eliminate slow SSR page loadsfull-stack81.2
Add i18n with locale routing to Next.js appfull-stack67.3
Implement zero-trust API authentication layerbackend73.2
Remove AI slop and over-engineering from codebaserefactoring80.0
Write Kubernetes manifests for Node.js microservicefull-stack84.9
Build distributed node cluster with gossip protocolfrom-scratch60.0
Build MCP server for database managementbackend58.8
Build CLI tool with subcommands and configfrom-scratch41.5
Build production website with auth and members areafrontend60.0
Implement background job scheduler with persistencebackend20.5
Fix hallucination and context window bugs in RAG agentbackend67.8
Build LLM evaluation harness with structured gradingbackend47.8
Implement transformer inference engine with KV cachefrom-scratch12.7
Build real-time portfolio risk calculatorbackend79.0
Fix race conditions in order matching enginebackend84.3
Fix data integrity bugs in denormalized e-commerce schemadebugging70.5
Build materialized view refresh pipeline for analyticsbackend52.0
Fix deadlocking transaction patterns in Flask appbackend50.6
Debug and fix 6 broken database triggers and constraintsdebugging47.4
Write complex SQL report with window functionsbackend58.4
Find and fix 4 hidden backdoors in Flask appdebugging91.5
Write tests for untested legacy Flask servicecode-review60.4
Add Google OAuth2 login to Express appfull-stack62.9
Optimize slow Postgres queries in Flask appbackend62.9
Add slash commands and moderation to Discord botbackend44.8
Add retry logic and dead letter queue to Python task queuebackend68.9
Fix Node.js stream backpressure causing OOM on large filesbackend70.9
Add virtual scrolling to table rendering 5000 rowsfrontend78.5
Fix 12 WCAG accessibility violations in checkout formfrontend77.8
Fix auth bypass vulnerabilitydebugging94.5
Write integration tests for payment flowcode-review60.8
Zero-downtime schema migrationfull-stack66.5
Add rate limiting middlewarebackend38.6
Fix flaky test suitedebugging91.8
Fix N+1 query in dashboardbackend63.4
Fix memory leak in event handlerdebugging71.5
Refactor monolithic handler to CQRSrefactoring67.6
Debug race condition in worker pooldebugging81.7
Fix React hydration mismatchfrontend76.5
Build terminal UI dashboardfrom-scratch48.0
Build REST API from scratchfrom-scratch79.8