
Will o1 (not preview) achieve a better score on LiveBench coding than Claude 3.5 Sonnet 10/22?
5
Ṁ100Ṁ261resolved Dec 14
Resolved
NO1H
6H
1D
1W
1M
ALL
Per LiveBench.ai Claude 3.5 Sonnet achieves 67.13 while o1-preview gets only 50.85.
Resolves when o1 is added to the LiveBench leaderboard
Update 2024-11-12 (PST): Market will resolve based on API results from LiveBench, not manual additions to the leaderboard. (AI summary of creator comment)
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ113 | |
| 2 | Ṁ17 | |
| 3 | Ṁ2 |