Will Claude Opus 4.5 achieve a SOTA score on SWE-rebench when it is first evaluated?
8
1kṀ6006Dec 31
94%
chance
1H
6H
1D
1W
1M
ALL
Resolves when Claude Opus 4.5 is evaluated and its score is visible on https://swe-rebench.com/
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Related questions
When will SOTA for Atari 100k pass human median and mean score on all 57 games?
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance
Will Claude Opus 4.5 exceed 80% on SWE-Bench verified?
YES
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?
Are these Claude Opus 4.5 leaked benchmark scores real?
1% chance
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?
BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027?
86% chance
How many parameters does the new possibly-SOTA large language model, Claude 3 Opus, have?
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
1% chance