Each option is the upper bound of the date range. All true options resolve YES (i.e. if it happens in 2027, then 2028 and 2029 resolve YES).
RE-bench (paper, github) is a benchmark of ML research engineering tasks. By what year will any AI achieve an average normalized score >= 0.8 within an 8 hour window? Note that the paper contains several variants of the benchmark, this question is specifically about the metric in Figure 5 of the paper. Note that 0.8 looks to be about peak human performance (within their evaluator set, at least).
A YES resolution requires this specific metric, not any of the others in the paper. Meaning that even if it is blindingly obvious that an AI can do this, I will not resolve YES until it actually does. Since RE-bench is open source I will run it myself in that scenario (assuming I can get access to the model).
If there are minor updates to RE-bench (e.g. a version 2 that includes a few additional questions) then I will accept results on the updated version OR the original version.
Best 8-hour AI score on RE-Bench >= 0.8 by what year?
2
Ṁ125Ṁ922030
60%
2026
66%
2027
74%
2028
76%
2029
83%
2030
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
76% chance
In what year will AI achieve a score of 95% or higher on the GPQA benchmark?
11/18/26
In what year will AI achieve a score of 85% or higher on the SimpleBench leaderboard?
7/20/32
In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?
7/1/28
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
12/10/26
In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?
2036
What year will the first AI exceed 80% on MLE-bench?
Chatbot Arena: How high will AI score in 2026?
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
45% chance
Will any AI model score above 95% on ARC-AGI-2 by end of 2026?
72% chance