Tasks
58 benchmark tasks
Fix hallucination and context window bugs in RAG agent
backendhardpython
Fix deadlocking transaction patterns in Flask app
backendexpertpython
Write complex SQL report with window functions
backendhardpython
Implement JWT auth middleware
backendmediumtypescript
Add streaming SSE endpoint for LLM chat
backendmediumtypescript
Build LLM evaluation harness with structured grading
backendexpertpython
Build real-time portfolio risk calculator
backendexpertpython
Fix race conditions in order matching engine
backendexpertpython
Build RAG pipeline with vector search
backendhardpython
Optimize slow Postgres queries in Flask app
backendhardpython
Fix N+1 query in dashboard
backendmediumpython
Add cursor-based pagination to REST API
backendmediumtypescript
Implement zero-trust API authentication layer
backendexperttypescript
Add slash commands and moderation to Discord bot
backendmediumpython
Implement Stripe webhook handler
backendhardtypescript
Add rate limiting middleware
backendeasygo
Add retry logic and dead letter queue to Python task queue
backendmediumpython
Fix Node.js stream backpressure causing OOM on large files
backendmediumtypescript
Build materialized view refresh pipeline for analytics
backendhardpython
Add Redis caching layer to Express API
backendmediumtypescript
Implement multi-tenant row-level security in Postgres
backendexperttypescript
Add file upload with S3 presigned URLs
backendmediumtypescript
Write tests for untested legacy Flask service
code-reviewmediumpython
Code review: identify security vulns
code-reviewmediumjavascript
Write integration tests for payment flow
code-reviewhardtypescript
Harden insecure Docker setup with 12 vulnerabilities
code-reviewmediumtypescript
Fix auth bypass vulnerability
debugginghardtypescript
Fix data integrity bugs in denormalized e-commerce schema
debuggingexpertpython
Debug race condition in worker pool
debugginghardgo
Find and patch all OWASP Top 10 vulnerabilities
debuggingexperttypescript
Find and fix 4 hidden backdoors in Flask app
debugginghardpython
Fix flaky test suite
debuggingmediumpython
Fix broken GitHub Actions CI pipeline
debuggingmediumtypescript
Fix memory leak in event handler
debugginghardtypescript
Debug and fix 6 broken database triggers and constraints
debuggingexpertpython
Build distributed node cluster with gossip protocol
from-scratchhardgo
Build terminal UI dashboard
from-scratchmediumpython
Build REST API from scratch
from-scratcheasypython
Fix React hydration mismatch
frontendmediumtypescript
Add virtual scrolling to table rendering 5000 rows
frontendmediumtypescript
Fix 12 WCAG accessibility violations in checkout form
frontendhardtypescript
Optimize bloated React bundle under 500KB
frontendmediumtypescript
Convert React app to PWA with offline support
frontendmediumtypescript
Fix broken responsive layout
frontendeasytypescript
Add i18n with locale routing to Next.js app
full-stackhardtypescript
Dockerize Node.js monorepo
full-stackmediumtypescript
Add WebSocket real-time updates
full-stackhardtypescript
Zero-downtime schema migration
full-stackhardpython
Write Kubernetes manifests for Node.js microservice
full-stackmediumtypescript
Add Google OAuth2 login to Express app
full-stackhardtypescript
Add caching layer to eliminate slow SSR page loads
full-stackmediumtypescript
Add GraphQL layer over REST API
multi-languagehardtypescript
Port Python CLI to Rust
multi-languageexpertrust
Remove AI slop and over-engineering from codebase
refactoringmediumtypescript
Refactor monolithic handler to CQRS
refactoringexperttypescript
Replace console.log with structured logging
refactoringmediumtypescript
Migrate callback-hell Express app to async/await
refactoringmediumjavascript
Split 1100-line god file into proper modules
refactoringmediumtypescript