APEX
Back to models

Claude Opus 4.7

Anthropic

200K context$15.00/M input$75.00/M output
1903

Avg Score

88.3

Avg Cost

$1.52

Score/$

58.3

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchmedium
3237
multi-languageexpert
3167
frontendexpert
2916
refactoringexpert
2892
frontendeasy
2671
code-reviewhard
2446
from-scratcheasy
2374
multi-languagehard
2309
from-scratchhard
2264
frontendhard
2255
backendeasy
2223
from-scratchexpert
2169
backendmaster
2126
backendexpert
2122
code-reviewmedium
2117
from-scratch
2092
code-review
2066
multi-language
2065
frontendmaster
2025
frontend
2023
refactoring
2012
refactoringmedium
1995
full-stackhard
1990
full-stackmedium
1963
full-stack
1962
frontendmedium
1959
backendhard
1941
debuggingexpert
1872
backend
1868
debuggingmedium
1824
debugging
1765
backendmedium
1736
debugginghard
1734

All Results

TaskCategoryScore
Add streaming SSE endpoint for LLM chatbackend80.1
Implement JWT auth middlewarebackend76.7
Migrate Express monolith to modular architecturebackend90.9
Refactor monolithic handler to CQRSrefactoring89.7
Fix memory leak in event handlerdebugging90.9
Fix and extend Chrome browser extensionfrontend84.8
Build multi-tool LLM agent runtimebackend90.9
Build interactive data visualization dashboardfrontend87.2
Fix hallucination and context window bugs in RAG agentbackend88.9
Find and patch all OWASP Top 10 vulnerabilitiesdebugging91.3
Implement zero-trust API authentication layerbackend88.5
Add caching layer to eliminate slow SSR page loadsfull-stack89.3
Find and fix 4 hidden backdoors in Flask appdebugging91.8
Zero-downtime schema migrationfull-stack89.2
Build MCP server for database managementbackend90.9
Build terminal UI dashboardfrom-scratch87.8
Fix N+1 query in dashboardbackend89.1
Write tests for untested legacy Flask servicecode-review89.7
Fix auth bypass vulnerabilitydebugging93.7
Add i18n with locale routing to Next.js appfull-stack88.2
Code review: identify security vulnscode-review91.3
Add retry logic and dead letter queue to Python task queuebackend87.2
Fix broken responsive layoutfrontend91.8
Build REST API from scratchfrom-scratch93.3
Add Google OAuth2 login to Express appfull-stack84.8
Fix race conditions in order matching enginebackend92.5
Build distributed node cluster with gossip protocolfrom-scratch82.9
Add file upload with S3 presigned URLsbackend87.5
Build real-time portfolio risk calculatorbackend86.1
Build LLM evaluation harness with structured gradingbackend87.7
Fix 12 WCAG accessibility violations in checkout formfrontend91.3
Optimize bloated React bundle under 500KBfrontend87.8
Optimize slow Postgres queries in Flask appbackend92.5
Fix Node.js stream backpressure causing OOM on large filesbackend81.3
Build materialized view refresh pipeline for analyticsbackend88.3
Implement transformer inference engine with KV cachefrom-scratch87.7
Add rate limiting middlewarebackend85.6
Build RAG pipeline with vector searchbackend86.6
Implement multi-tenant row-level security in Postgresbackend86.5
Build SaaS admin dashboard from scratchfrom-scratch86.8
Implement background job scheduler with persistencebackend86.6
Replace console.log with structured loggingrefactoring91.8
Write Kubernetes manifests for Node.js microservicefull-stack94.8
Remove AI slop and over-engineering from codebaserefactoring89.4
Add GraphQL layer over REST APImulti-language87.4
Convert React app to PWA with offline supportfrontend87.0
Implement Stripe webhook handlerbackend77.3
Fix data integrity bugs in denormalized e-commerce schemadebugging90.9
Fix broken GitHub Actions CI pipelinedebugging93.7
Add WebSocket real-time updatesfull-stack88.2
Build 3D browser game with physics and multiplayer syncfrontend84.5
Build production website with auth and members areafrontend86.5
Debug and fix 6 broken database triggers and constraintsdebugging87.0
Migrate callback-hell Express app to async/awaitrefactoring90.9
Add Redis caching layer to Express APIbackend83.1
Split 1100-line god file into proper modulesrefactoring89.0
Write integration tests for payment flowcode-review88.3
Build CLI tool with subcommands and configfrom-scratch89.3
Port Python CLI to Rustmulti-language89.3
Build codebase indexer for LLM context windowsfrom-scratch86.1
Fix deadlocking transaction patterns in Flask appbackend91.8
Add virtual scrolling to table rendering 5000 rowsfrontend86.5
Write complex SQL report with window functionsbackend91.3
Add slash commands and moderation to Discord botbackend87.8
Fix React hydration mismatchfrontend87.5
Fix flaky test suitedebugging89.5
Dockerize Node.js monorepofull-stack83.9
Add cursor-based pagination to REST APIbackend87.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review94.6
Debug race condition in worker pooldebugging91.3