Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
Basic
6
Ṁ1452025
70%
chance
1D
1W
1M
ALL
Resolves "Yes" if, at time of closure, there is an entry on the SWE-bench leaderboard (https://www.swebench.com/) with score greater or equal to 90%.
Linked Questions:
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2025?
18% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
72% chance
AI resolves at least X% on SWE-bench assistance, by 2025?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
Will an AI agent system be able to score at least 40% on level 3 tasks in the GAIA benchmark before 2025.
48% chance
Will >50% of the tasks in the WebArena benchmark be solved by EOY 2024?
62% chance
Will an AI SWE model score higher than 50% on SWE-bench in 2024?
20% chance
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
38% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?
Will a smart agent pass our Turing test by the end of 2025?
59% chance