APEX TESTING_
Find out which AI coding models actually deliver and which are just hype.
by HauhauCS
Models Tested
52
Tasks
70
Total Runs
5470
Avg Score
66.4
Capital Spent
$5335.05
Top Models
View full leaderboard →| # | Model | ELO |
|---|---|---|
| 1 | Claude Sonnet 4.6 | 1862 |
| 2 | Claude Opus 4.6 | 1862 |
| 3 | GPT 5.2 | 1853 |
| 4 | GPT 5.3 Codex | 1829 |
| 5 | Claude Opus 4.5 | 1806 |
Recent Activity
No completed runs yet