APEX
Back to models

GLM 4.7 [Q4_K_XL]

LM Studio

200K context$0.60/M input$2.20/M output
1507peak 1526

Avg Score

70.6

Avg Cost

$0.04

Score/$

2002.5

Runs

41

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
2072
multi-languagehard
1918
refactoringexpert
1822
backendeasy
1821
from-scratcheasy
1819
multi-language
1767
frontendexpert
1716
from-scratch
1690
from-scratchhard
1678
from-scratchmedium
1669
refactoring
1613
frontendhard
1590
frontend
1526
backendhard
1501
backendexpert
1500
backend
1498
frontendmedium
1488
debugginghard
1474
backendmedium
1473
full-stack
1439
full-stackhard
1439
debuggingmedium
1432
debugging
1394
code-review
1351
code-reviewmedium
1341
debuggingexpert
1207
code-reviewhard
966

All Results

TaskCategoryScore
Build production website with auth and members areafrontend67.5
Fix data integrity bugs in denormalized e-commerce schemadebugging63.4
Build LLM evaluation harness with structured gradingbackend74.4
Build MCP server for database managementbackend57.4
Implement background job scheduler with persistencebackend73.1
Fix hallucination and context window bugs in RAG agentbackend64.4
Build SaaS admin dashboard from scratchfrom-scratch73.6
Implement transformer inference engine with KV cachefrom-scratch87.5
Build real-time portfolio risk calculatorbackend58.9
Build CLI tool with subcommands and configfrom-scratch47.6
Fix race conditions in order matching enginebackend80.1
Fix deadlocking transaction patterns in Flask appbackend62.4
Debug and fix 6 broken database triggers and constraintsdebugging68.1
Write complex SQL report with window functionsbackend71.5
Find and fix 4 hidden backdoors in Flask appdebugging88.5
Add Redis caching layer to Express APIbackend50.9
Write tests for untested legacy Flask servicecode-review54.5
Add Google OAuth2 login to Express appfull-stack79.8
Optimize slow Postgres queries in Flask appbackend81.2
Add slash commands and moderation to Discord botbackend68.7
Add retry logic and dead letter queue to Python task queuebackend51.2
Fix Node.js stream backpressure causing OOM on large filesbackend89.8
Add virtual scrolling to table rendering 5000 rowsfrontend82.2
Fix 12 WCAG accessibility violations in checkout formfrontend80.0
Build distributed node cluster with gossip protocolfrom-scratch68.6
Fix auth bypass vulnerabilitydebugging88.5
Add GraphQL layer over REST APImulti-language77.5
Write integration tests for payment flowcode-review58.5
Zero-downtime schema migrationfull-stack61.8
Add rate limiting middlewarebackend79.0
Implement Stripe webhook handlerbackend70.5
Fix flaky test suitedebugging73.9
Add cursor-based pagination to REST APIbackend82.9
Fix N+1 query in dashboardbackend75.8
Fix memory leak in event handlerdebugging61.1
Code review: identify security vulnscode-review49.0
Refactor monolithic handler to CQRSrefactoring68.8
Debug race condition in worker pooldebugging85.0
Fix React hydration mismatchfrontend67.5
Build terminal UI dashboardfrom-scratch63.0
Build REST API from scratchfrom-scratch86.2