Will Claude Opus 4.5 achieve a SOTA score on SWE-rebench when it is first evaluated?
2
1kṀ250Dec 31
61%
chance
1H
6H
1D
1W
1M
ALL
Resolves when Claude Opus 4.5 is evaluated and its score is visible on https://swe-rebench.com/
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Related questions
Will Claude Opus 4.5 exceed 80% on SWE-Bench verified?
60% chance
Claude Opus 4.5 released before 2026?
96% chance
Are these Claude Opus 4.5 leaked benchmark scores real?
2% chance
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?
How many parameters does the new possibly-SOTA large language model, Claude 3 Opus, have?
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
1% chance
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?
BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027?
86% chance
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance