
Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?
11
100Ṁ522resolved Feb 5
Resolved
NO1D
1W
1M
ALL
https://simple-bench.com/ Claude 3.5 Sonnet 10/22 achieves 41.4% whereas the best Gemini model scores 27.1%
Update 2025-22-01 (PST): - Resolution Date: The market will now be resolved on February 1st, 2025 instead of the previously stated date. (AI summary of creator comment)
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ116 | |
2 | Ṁ46 | |
3 | Ṁ27 |
Sort by:
Related questions
Related questions
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
45% chance
How long until one of Gemini, Claude, etc... match the capabilities of O1?
Will Gemini 2 be released before EOY 2025?
97% chance
What will be the best score on Cybench by December 31st 2025?
Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?
18% chance
What will be true of Gemini 2?