Will Claude 3.7 exceed Claude 3.5 new in LiveBench on the minimum reasoning effort setting?
7
1kṀ1798resolved Feb 24
Resolved
YES1D
1W
1M
ALL
Claude 3.7 is rumored to be Claude's upcoming new model that can switch between deep thinking like o1 and instant responses like a conventional LLM with the same set of weights. Resolves yes if Claude 3.7 exceeds Claude 3.5 new on LiveBench using its instant response setting and no otherwise.
Resolves N/A if the model turns out to not have an instant response mode at all
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ257 | |
2 | Ṁ200 | |
3 | Ṁ79 | |
4 | Ṁ38 | |
5 | Ṁ26 |
Related questions
Related questions
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
What will Claude 3.7 Sonnet's reported 0-shot performance on GPQA Diamond be upon release?
Which will be released first: Claude 3.5 Opus or Claude 4.0 Sonnet?
Will Claude 3.5 Opus be available via API by end of 2025?
38% chance
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
6% chance
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
10% chance
Does Claude 3.5 have control vector(s) to increase its capabilities?
33% chance
What will be Claude 4's maximum context length when released?
What will be the *first* ELO Rating of Claude 3.5 Opus in the LMSYS Arena?
Will Claude MCP have equivalent functionality to a Claude Computer Use module by EOY2025?
57% chance