
Codebuff solves at least 40% of issues on SWE-Bench by March 31, 2025
5
1kṀ1386resolved Apr 10
Resolved
NO1H
6H
1D
1W
1M
ALL
(This market is AI-generated but I read it and it seems right)
This market predicts whether Codebuff will achieve a 40% success rate on the SWE-Bench dataset, which is a benchmark of human-selected program issues.
Resolution will be based on official results published on the SWE-Bench dataset or Codebuff project's official channels.
References:
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ555 | |
2 | Ṁ1 |
People are also trading
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
36% chance
Top SWE-Bench Verified score in 2025?
85.0
AI resolves at least X% on SWE-bench without any assistance, by 2028?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
What will be the best performance on SWE-bench Verified by December 31st 2025?
When will SWE-bench be solved?
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
50% chance
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
12/5/27
Will rw_search be able to replace >50% of mathlib proofs by 2025-11-26?
19% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?
Sort by:
Nice. We can do it!
I assume you mean the full SWE bench. We're more likely to work on the Lite or Verified subset.
People are also trading
Related questions
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
36% chance
Top SWE-Bench Verified score in 2025?
85.0
AI resolves at least X% on SWE-bench without any assistance, by 2028?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
What will be the best performance on SWE-bench Verified by December 31st 2025?
When will SWE-bench be solved?
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
50% chance
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
12/5/27
Will rw_search be able to replace >50% of mathlib proofs by 2025-11-26?
19% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?