APEX
Back to models

Kimi K2.6

OpenRouter

262K context$0.73/M input$3.49/M output
1702peak 1703

Avg Score

80.8

Avg Cost

$0.47

Score/$

170.7

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
2480
refactoringexpert
2458
from-scratchmedium
2341
code-reviewhard
2311
backendeasy
2290
from-scratcheasy
2157
multi-languageexpert
1882
frontendhard
1858
code-reviewmedium
1854
code-review
1853
multi-language
1829
backendhard
1829
from-scratchhard
1820
refactoring
1814
refactoringmedium
1797
debuggingexpert
1779
debuggingmedium
1779
from-scratch
1752
frontendeasy
1748
frontendexpert
1736
backendexpert
1733
full-stackhard
1732
backend
1722
from-scratchexpert
1717
frontendmaster
1715
full-stack
1676
frontend
1670
frontendmedium
1664
backendmedium
1659
debugging
1622
full-stackmedium
1611
debugginghard
1517
backendmaster
1276

All Results

TaskCategoryScore
Add streaming SSE endpoint for LLM chatbackend84.3
Replace console.log with structured loggingrefactoring73.3
Fix and extend Chrome browser extensionfrontend63.6
Migrate Express monolith to modular architecturebackend70.2
Build real-time portfolio risk calculatorbackend41.7
Implement zero-trust API authentication layerbackend81.7
Add file upload with S3 presigned URLsbackend70.3
Write tests for untested legacy Flask servicecode-review88.1
Optimize slow Postgres queries in Flask appbackend87.1
Build production website with auth and members areafrontend70.5
Build SaaS admin dashboard from scratchfrom-scratch62.1
Add slash commands and moderation to Discord botbackend75.7
Fix deadlocking transaction patterns in Flask appbackend86.6
Implement background job scheduler with persistencebackend72.5
Dockerize Node.js monorepofull-stack81.9
Migrate callback-hell Express app to async/awaitrefactoring85.3
Build LLM evaluation harness with structured gradingbackend82.9
Add Redis caching layer to Express APIbackend86.6
Fix race conditions in order matching enginebackend90.9
Remove AI slop and over-engineering from codebaserefactoring86.3
Fix flaky test suitedebugging87.0
Split 1100-line god file into proper modulesrefactoring86.4
Write Kubernetes manifests for Node.js microservicefull-stack86.8
Build MCP server for database managementbackend87.0
Implement JWT auth middlewarebackend82.4
Build CLI tool with subcommands and configfrom-scratch70.5
Add GraphQL layer over REST APImulti-language91.2
Add WebSocket real-time updatesfull-stack79.3
Fix N+1 query in dashboardbackend78.3
Code review: identify security vulnscode-review78.7
Add caching layer to eliminate slow SSR page loadsfull-stack78.3
Build RAG pipeline with vector searchbackend87.5
Build materialized view refresh pipeline for analyticsbackend81.7
Build codebase indexer for LLM context windowsfrom-scratch81.3
Write integration tests for payment flowcode-review84.8
Build distributed node cluster with gossip protocolfrom-scratch81.6
Fix broken GitHub Actions CI pipelinedebugging93.0
Add i18n with locale routing to Next.js appfull-stack82.2
Build multi-tool LLM agent runtimebackend67.3
Fix 12 WCAG accessibility violations in checkout formfrontend84.7
Port Python CLI to Rustmulti-language62.2
Convert React app to PWA with offline supportfrontend81.3
Implement multi-tenant row-level security in Postgresbackend83.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging80.5
Find and fix 4 hidden backdoors in Flask appdebugging89.3
Fix auth bypass vulnerabilitydebugging92.2
Fix data integrity bugs in denormalized e-commerce schemadebugging88.4
Write complex SQL report with window functionsbackend87.5
Fix hallucination and context window bugs in RAG agentbackend81.8
Fix memory leak in event handlerdebugging52.2
Debug and fix 6 broken database triggers and constraintsdebugging86.8
Build 3D browser game with physics and multiplayer syncfrontend85.0
Optimize bloated React bundle under 500KBfrontend71.4
Implement transformer inference engine with KV cachefrom-scratch80.5
Zero-downtime schema migrationfull-stack81.7
Build interactive data visualization dashboardfrontend79.1
Add retry logic and dead letter queue to Python task queuebackend82.8
Build terminal UI dashboardfrom-scratch77.3
Refactor monolithic handler to CQRSrefactoring82.5
Implement Stripe webhook handlerbackend87.5
Add Google OAuth2 login to Express appfull-stack80.8
Fix React hydration mismatchfrontend84.4
Fix Node.js stream backpressure causing OOM on large filesbackend91.7
Build REST API from scratchfrom-scratch87.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review93.5
Add virtual scrolling to table rendering 5000 rowsfrontend82.4
Fix broken responsive layoutfrontend77.7
Debug race condition in worker pooldebugging84.8
Add rate limiting middlewarebackend86.8
Add cursor-based pagination to REST APIbackend80.9