APEX
Back to models

GLM 4.7 [Q4_K_XL]

LM Studio

200K context$0.60/M input$2.20/M output
1476peak 1477

Avg Score

71.2

Avg Cost

$0.04

Score/$

2021.2

Runs

41

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
2157
from-scratcheasy
2128
frontendexpert
1956
backendeasy
1951
from-scratchmedium
1920
multi-languagehard
1746
refactoringexpert
1733
from-scratch
1695
from-scratchhard
1642
frontendhard
1630
frontend
1525
refactoring
1521
multi-language
1510
debugginghard
1502
backend
1469
backendmedium
1464
frontendmedium
1456
backendhard
1453
full-stackhard
1447
full-stack
1446
backendexpert
1423
debugging
1363
code-reviewmedium
1299
code-review
1271
debuggingmedium
1264
debuggingexpert
1111
code-reviewhard
656

All Results

TaskCategoryScore
Build production website with auth and members areafrontend72.5
Fix data integrity bugs in denormalized e-commerce schemadebugging63.4
Build LLM evaluation harness with structured gradingbackend74.4
Build MCP server for database managementbackend57.4
Implement background job scheduler with persistencebackend73.1
Fix hallucination and context window bugs in RAG agentbackend64.4
Build SaaS admin dashboard from scratchfrom-scratch73.6
Implement transformer inference engine with KV cachefrom-scratch87.5
Build real-time portfolio risk calculatorbackend58.9
Build CLI tool with subcommands and configfrom-scratch57.2
Fix race conditions in order matching enginebackend80.1
Fix deadlocking transaction patterns in Flask appbackend62.4
Debug and fix 6 broken database triggers and constraintsdebugging68.1
Write complex SQL report with window functionsbackend71.5
Find and fix 4 hidden backdoors in Flask appdebugging88.5
Add Redis caching layer to Express APIbackend50.9
Write tests for untested legacy Flask servicecode-review54.5
Add Google OAuth2 login to Express appfull-stack79.8
Optimize slow Postgres queries in Flask appbackend81.2
Add slash commands and moderation to Discord botbackend68.7
Add retry logic and dead letter queue to Python task queuebackend51.2
Fix Node.js stream backpressure causing OOM on large filesbackend89.8
Add virtual scrolling to table rendering 5000 rowsfrontend82.2
Fix 12 WCAG accessibility violations in checkout formfrontend80.0
Build distributed node cluster with gossip protocolfrom-scratch74.5
Fix auth bypass vulnerabilitydebugging88.5
Add GraphQL layer over REST APImulti-language77.5
Write integration tests for payment flowcode-review58.5
Zero-downtime schema migrationfull-stack61.8
Add rate limiting middlewarebackend79.0
Implement Stripe webhook handlerbackend70.5
Fix flaky test suitedebugging73.9
Add cursor-based pagination to REST APIbackend82.9
Fix N+1 query in dashboardbackend75.8
Fix memory leak in event handlerdebugging61.1
Code review: identify security vulnscode-review49.0
Refactor monolithic handler to CQRSrefactoring68.8
Debug race condition in worker pooldebugging85.0
Fix React hydration mismatchfrontend67.5
Build terminal UI dashboardfrom-scratch68.6
Build REST API from scratchfrom-scratch87.0