
Will o1 score ≥60% on the REBUS benchmark?
5
1kṀ1905resolved Mar 11
Resolved
YES1H
6H
1D
1W
1M
ALL
Update 2024-22-12 (PST): This market refers to the REBUS benchmark as described in the paper "REBUS: A Benchmark to Evaluate the Rationality of Language Models" (AI summary of creator comment)
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ510 | |
2 | Ṁ101 | |
3 | Ṁ83 |
People are also trading
Related questions
What will be the best normalized score achieved on the original 7 RE-Bench tasks by December 31st 2025?
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
26% chance
Will an AI score over 80% on FrontierMath Benchmark in 2025
10% chance
Will an LLM get > 50% on hard problems on LiveCodeBench Pro?
50% chance
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
10/7/28
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2028?
75% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?
77% chance
Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2027?
68% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?
45% chance
Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?
60% chance