
Will o1 score ≥60% on the REBUS benchmark?
5
Ṁ1kṀ1.9kresolved Mar 11
Resolved
YES1H
6H
1D
1W
1M
ALL
Update 2024-22-12 (PST): This market refers to the REBUS benchmark as described in the paper "REBUS: A Benchmark to Evaluate the Rationality of Language Models" (AI summary of creator comment)
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ510 | |
| 2 | Ṁ101 | |
| 3 | Ṁ83 |
People are also trading
Best 8-hour AI score on RE-Bench >= 0.8 by what year?
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?
84% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2027?
61% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2028?
79% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?
7% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?
33% chance
Will OpenAI's o4 get above 50% on humanity's last exam?
16% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?
29% chance
In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?
4/6/28
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
10/29/27
Sort by:
@derikk after looking at the examples and not getting any correct and then seeing 83% as the human baseline I felt really bad till I read that humans were allowed to Google and use reverse image search.
People are also trading
Related questions
Best 8-hour AI score on RE-Bench >= 0.8 by what year?
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?
84% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2027?
61% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2028?
79% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?
7% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?
33% chance
Will OpenAI's o4 get above 50% on humanity's last exam?
16% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?
29% chance
In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?
4/6/28
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
10/29/27