Will Claude Opus 4.6 exceed 85% on SWE-bench?
6
Ṁ100Ṁ581in a day
2%
chance
1H
6H
1D
1W
1M
ALL
According to Anthropic (on minimalist scaffolding)
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
People are also trading
Related questions
Will the next Claude Sonnet be better than Claude 4.5 Opus at software engineering?
79% chance
Will Anthropic’s next Sonnet model exceed 65% on terminal bench?
73% chance
Will Anthropic’s next Sonnet model exceed 83% on SWE-bench verified?
50% chance
Will Claude Sonnet 5 exceed 85% on SWE-bench verified?
16% chance
Will Claude powerusers think Sonnet 4.6 is strictly better for everyday use than Opus 4.5, analogously to s4.5 vs o4.1?
40% chance
AI resolves at least X% on SWE-bench without any assistance, by 2028?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?