Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
Basic
26
Ṁ1020Jan 2
6%
chance
1D
1W
1M
ALL
Note that if OpenAI ends of changing their naming scheme to something else, I will count it if the model appears to be the one mentioned in this blog post: https://openai.com/index/openai-board-forms-safety-and-security-committee/
Additionally, it will only be resolved yes if 3.5 Opus sustains its position for the entire first month after both models are listed in the leaderboards, so if it passes GPT-5 temporarily due to noise it will not count.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
32% chance
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
19% chance
Will Claude 3.5 Opus be able to draw me in tic-tac-toe while playing as O at least 1/3 of the time?
32% chance
Will any open-source model rank in the top 3 on Chatbot Arena at any point in 2024? (resolves based on ELO rating)
3% chance
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
What will be the *first* ELO Rating of Claude 3.5 Opus in the LMSYS Arena?
Will there be a model with a 69%+ Chatbot Arena win rate against gpt-o1 before June 1st, 2025?
47% chance
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
13% chance
Will Claude 3.5 Opus be available via API by end of 2025?
70% chance
Will an Open Source LLM Surpass any GPT-4 model in Elo Rating on Chatbot Arena on december 31, 2024?
96% chance