Will Anthropic’s next Sonnet model exceed 83% on SWE-bench verified?
8
Ṁ100Ṁ668resolved Feb 17
Resolved
NO1H
6H
1D
1W
1M
ALL
Only counts if the number in the model’s name increments, so a new Claude Sonnet 4.5 checkpoint does not count.
If a new Sonnet model is not released by 2027 this will resolve NA
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ42 | |
| 2 | Ṁ32 | |
| 3 | Ṁ29 | |
| 4 | Ṁ27 | |
| 5 | Ṁ8 |
Sort by:
@JaundicedBaboon Introducing Sonnet 4.6 \ Anthropic
The official release page has it at 79.6%, which is substantially below 83%.
People are also trading
Related questions
Will Anthropic’s next Sonnet model exceed 65% on terminal bench?
3% chance
Will Claude Sonnet 5 exceed 85% on SWE-bench verified?
36% chance
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?
2/29/28
Will Anthropic release an open-weights model in 2026?
16% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?