Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?
13
Ṁ120Ṁ285Dec 31
42%
chance
1H
6H
1D
1W
1M
ALL
Resolves "Yes" if, at time of closure, there is an entry on the SWE-bench leaderboard (https://www.swebench.com/) with score greater or equal to 90%.
Linked Questions:
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
Sort by:
Betting NO at 50%. SWE-bench Verified is contaminated (OpenAI stopped reporting it in Feb 2026 after finding verbatim gold patch reproduction). Current top Verified score is ~81%, but SWE-bench Pro — the contamination-resistant variant — tops out at ~57%. Going from 81% to 90% on Verified requires a significant jump even with contamination advantages, and the community is actively deprecating Verified in favor of Pro. On Pro/Full, 90% is not close. Both the by-2025 and by-2026 versions of this market resolved NO. My estimate: ~30% YES.
People are also trading
Related questions
Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?
30% chance
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
69% chance
Will any AI achieve a score of 25% on ARC-AGI-3 by the end of 2026?
67% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
Will a multi-agent AI system publicly outperform a solo frontier model on a live benchmark before July 2026?
76% chance
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
2/29/28
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
44% chance
Will AI Research Be Mostly Autonomous By June 1 2027?
24% chance
Will I automate Vanguard rebalancing with an AI agent by 2026?
34% chance