APEX

Tasks

58 benchmark tasks

Fix hallucination and context window bugs in RAG agent

backendhardpython

Fix deadlocking transaction patterns in Flask app

backendexpertpython

Write complex SQL report with window functions

backendhardpython

Implement JWT auth middleware

backendmediumtypescript

Add streaming SSE endpoint for LLM chat

backendmediumtypescript

Build LLM evaluation harness with structured grading

backendexpertpython

Build real-time portfolio risk calculator

backendexpertpython

Fix race conditions in order matching engine

backendexpertpython

Build RAG pipeline with vector search

backendhardpython

Optimize slow Postgres queries in Flask app

backendhardpython

Fix N+1 query in dashboard

backendmediumpython

Add cursor-based pagination to REST API

backendmediumtypescript

Implement zero-trust API authentication layer

backendexperttypescript

Add slash commands and moderation to Discord bot

backendmediumpython

Implement Stripe webhook handler

backendhardtypescript

Add rate limiting middleware

backendeasygo

Add retry logic and dead letter queue to Python task queue

backendmediumpython

Fix Node.js stream backpressure causing OOM on large files

backendmediumtypescript

Build materialized view refresh pipeline for analytics

backendhardpython

Add Redis caching layer to Express API

backendmediumtypescript

Implement multi-tenant row-level security in Postgres

backendexperttypescript

Add file upload with S3 presigned URLs

backendmediumtypescript

Write tests for untested legacy Flask service

code-reviewmediumpython

Code review: identify security vulns

code-reviewmediumjavascript

Write integration tests for payment flow

code-reviewhardtypescript

Harden insecure Docker setup with 12 vulnerabilities

code-reviewmediumtypescript

Fix auth bypass vulnerability

debugginghardtypescript

Fix data integrity bugs in denormalized e-commerce schema

debuggingexpertpython

Debug race condition in worker pool

debugginghardgo

Find and patch all OWASP Top 10 vulnerabilities

debuggingexperttypescript

Find and fix 4 hidden backdoors in Flask app

debugginghardpython

Fix flaky test suite

debuggingmediumpython

Fix broken GitHub Actions CI pipeline

debuggingmediumtypescript

Fix memory leak in event handler

debugginghardtypescript

Debug and fix 6 broken database triggers and constraints

debuggingexpertpython

Build distributed node cluster with gossip protocol

from-scratchhardgo

Build terminal UI dashboard

from-scratchmediumpython

Build REST API from scratch

from-scratcheasypython

Fix React hydration mismatch

frontendmediumtypescript

Add virtual scrolling to table rendering 5000 rows

frontendmediumtypescript

Fix 12 WCAG accessibility violations in checkout form

frontendhardtypescript

Optimize bloated React bundle under 500KB

frontendmediumtypescript

Convert React app to PWA with offline support

frontendmediumtypescript

Fix broken responsive layout

frontendeasytypescript

Add i18n with locale routing to Next.js app

full-stackhardtypescript

Dockerize Node.js monorepo

full-stackmediumtypescript

Add WebSocket real-time updates

full-stackhardtypescript

Zero-downtime schema migration

full-stackhardpython

Write Kubernetes manifests for Node.js microservice

full-stackmediumtypescript

Add Google OAuth2 login to Express app

full-stackhardtypescript

Add caching layer to eliminate slow SSR page loads

full-stackmediumtypescript

Add GraphQL layer over REST API

multi-languagehardtypescript

Port Python CLI to Rust

multi-languageexpertrust

Remove AI slop and over-engineering from codebase

refactoringmediumtypescript

Refactor monolithic handler to CQRS

refactoringexperttypescript

Replace console.log with structured logging

refactoringmediumtypescript

Migrate callback-hell Express app to async/await

refactoringmediumjavascript

Split 1100-line god file into proper modules

refactoringmediumtypescript