Codebuff solves at least 40% of issues on SWE-Bench by March 31, 2025
Plus
1
Ṁ1250Mar 31
16%
chance
1D
1W
1M
ALL
(This market is AI-generated but I read it and it seems right)
This market predicts whether Codebuff will achieve a 40% success rate on the SWE-Bench dataset, which is a benchmark of human-selected program issues.
Resolution will be based on official results published on the SWE-Bench dataset or Codebuff project's official channels.
References:
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Nice. We can do it!
I assume you mean the full SWE bench. We're more likely to work on the Lite or Verified subset.
Related questions
Related questions
80% on SWE-Bench Verified by Jan 1 2025
10% chance
AI resolves at least X% on SWE-bench assistance, by 2025?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
40% on cybench by EOY 2024
38% chance
Will >50% of the tasks in the WebArena benchmark be solved by EOY 2024?
62% chance
What will be the best score on the SWE-Bench (unassisted) benchmark before 2025?
39% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
55% chance
Will an AI SWE model score higher than 50% on SWE-bench in 2024?
20% chance
When will SWE-bench be solved?