
Will o1 (not preview) achieve a better score on LiveBench coding than Claude 3.5 Sonnet 10/22?
5
100Ṁ261resolved Dec 14
Resolved
NO1D
1W
1M
ALL
Per LiveBench.ai Claude 3.5 Sonnet achieves 67.13 while o1-preview gets only 50.85.
Resolves when o1 is added to the LiveBench leaderboard
Update 2024-11-12 (PST): Market will resolve based on API results from LiveBench, not manual additions to the leaderboard. (AI summary of creator comment)
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ113 | |
2 | Ṁ17 | |
3 | Ṁ2 |
People are also trading
Related questions
Will the best AI score on the IMO 2025 be more like AlphaProof or o3?
Will I judge GPT-5 to be smarter than o1 (not preview) after both are released?
81% chance
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
Will GPT-5 perform better than o1 (not preview) at AIME 2024, Codeforces elo, GPQA, or the 2024 ioi?
91% chance
Which will be released first: Claude 3.5 Opus or Claude 4.0 Sonnet?
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
6% chance
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
9% chance
Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?
49% chance
What will be the *first* ELO Rating of Claude 3.5 Opus in the LMSYS Arena?
Will GPT-5 score higher than 1350 on the Lmsys Arena Leaderboard
95% chance