Will Claude Opus 4.5 achieve a SOTA score on SWE-rebench when it is first evaluated?
17
1kṀ8697resolved Nov 26
Resolved
YES1H
6H
1D
1W
1M
ALL
Resolves when Claude Opus 4.5 is evaluated and its score is visible on https://swe-rebench.com/
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
| # | Name | Total profit |
|---|---|---|
| 1 | Ṁ479 | |
| 2 | Ṁ156 | |
| 3 | Ṁ89 | |
| 4 | Ṁ74 | |
| 5 | Ṁ59 |
People are also trading
Related questions
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?
How many parameters does the new possibly-SOTA large language model, Claude 3 Opus, have?
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
1% chance
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?
BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027?
86% chance
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
When will SOTA for Atari 100k pass human median and mean score on all 57 games?
[Carlini questions] SOTA AI scores better than X% of other participants in competitive programming contest by 2030
95.2
[Carlini questions] SOTA AI scores better than X% of other participants in competitive programming contest by 2027
91.5