
Will o1 (not preview) achieve a better score on LiveBench coding than Claude 3.5 Sonnet 10/22?
5
100Ṁ261resolved Dec 14
Resolved
NO1H
6H
1D
1W
1M
ALL
Per LiveBench.ai Claude 3.5 Sonnet achieves 67.13 while o1-preview gets only 50.85.
Resolves when o1 is added to the LiveBench leaderboard
Update 2024-11-12 (PST): Market will resolve based on API results from LiveBench, not manual additions to the leaderboard. (AI summary of creator comment)
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ113 | |
2 | Ṁ17 | |
3 | Ṁ2 |
People are also trading
Related questions
Will the best AI score on the IMO 2025 be more like AlphaProof or o3?
Will GPT-5 perform better than o1 (not preview) at AIME 2024, Codeforces elo, GPQA, or the 2024 ioi?
91% chance
Will I judge GPT-5 to be smarter than o1 (not preview) after both are released?
81% chance
Will an LLM get > 50% on hard problems on LiveCodeBench Pro?
50% chance
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
6% chance
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
9% chance
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
12% chance
Will the GPT4+code-interpreter+search score > 1350 on Lmsys Arena Leaderboard?
49% chance