🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ2,029 | |
| 2 | Ṁ745 | |
| 3 | Ṁ687 | |
| 4 | Ṁ242 | |
| 5 | Ṁ197 |
People are also trading
Good argument but seems to me like 4.5 is pretty even with Opus 4.1. If we were forecasting Opus 4.2's METR 50% time horizon it would be one thing, but this model is much smaller (than Opus) and probably not RLed as heavily as GPT-5 has been. It seems like people are very much not universally convinced that s4.5 is as good as codex at coding tasks, and on top of that it has less reasoning-buff and openai-doing-better-at-this-task-historically buff. sonnet 4 to 4.5 requires a 100% jump for this market to resolve yes, and 4.5 is built from sonnet 4, not from opus 4.1, so that seems like the more relevant frame to me. we will see
@Bayesian Sonnet 4 does somewhat worse than you'd guess based on its general performance on SWE, e.g. most people think Sonnet 4 vs GPT-5 is a close call in real-world SWE. The question is whether this "residual underperformance" carries over to Sonnet 4.5. But Sonnets (3.5 to 3.7) have historically led in METR time horizon.
