AI resolves at least X% on SWE-bench WITH assistance, by 2028? | Manifold

AI resolves at least X% on SWE-bench WITH assistance, by 2028?

21

1.9kṀ6325

2027

91%

X = 40

92%

X = 50

89%

X = 60

92%

X = 65

89%

X = 70

79%

X = 75

72%

X = 80

77%

X = 85

Currently the SOTA has 4.80% resolves "with assistance":

For the unassisted leaderboard, please refer to the following market:

Leaderboard live:

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Will we solve AI alignment by 2026?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?

What will be the highest score achieved on SWE-Bench Verified in 2025?

What will be the best performance on SWE-bench Verified by December 31st 2025?

Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.

Top Multi-SWE-bench score in 2025?

Top SWE-Bench Verified score in 2025?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Related questions

Will we solve AI alignment by 2026?

AI resolves at least X% on SWE-bench without any assistance, by 2028?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?

Will an autonomous agent resolve 90% of tasks on SWE-bench by 2027?

What will be the highest score achieved on SWE-Bench Verified in 2025?

What will be the best performance on SWE-bench Verified by December 31st 2025?

Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.

Top Multi-SWE-bench score in 2025?

Top SWE-Bench Verified score in 2025?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

© Manifold Markets, Inc.•Terms•Privacy