Will an autonomous agent resolve 90% of tasks on SWE-bench by 2025?
7
29
Ṁ150Ṁ220
Dec 31
25%
chance
1D
1W
1M
ALL
Resolves "Yes" if, at time of closure, there is an entry on the SWE-bench leaderboard (https://www.swebench.com/) with score greater or equal to 90%.
Linked Questions:
Get Ṁ600 play money
Sort by:
@DavidFWatson That's an excellent question. Let's explore possibilities:
This could be included in the question, i.e. what matters is only the number on the benchmark, regardless of whether it was gamed
I could wait a certain amount of time to check if no controversy emerges. Feels like one month would be safe. The question then resolves yes if one month after the deadline, I judge that there is no consensus that the number was gamed. This makes the question more informative.
Related questions
Will any AI be able to formalize >=90% of IMO problems by the start of 2025?
33% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
57% chance
Will an autonomous personal AI agent, capable of managing daily affairs, be available by the end of 2024?
29% chance
AI resolves at least X% on SWE-bench assistance, by 2025?
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
71% chance
Will an AI model outperform 95% of Manifold users on accuracy before 2026?
67% chance
By 2025, will most well-educated people expect AI to within 10 years be better at intellectual work than 99% of humans?
19% chance
Will a smart agent pass our Turing test by the end of 2025?
65% chance