Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?

Ṁ100Ṁ5.5k

resolved Aug 1

Resolved as

ALL

OpenAI's best released model could be GPT-4, GPT-4o, or something else. It does not count as an OpenAI model unless it's made available to the public to try, and is known to be from OpenAI (e.g. the model can not be a secret, pseudonymous release). If arena.lmsys.org is not available at the time, the successor site or most similar leaderboard will be used.

Resolves yes if Claude 3.5 Opus is ranked above all OpenAI models 1 week after it is put on the leaderboard.

Update 2025-01-01 (PST) (AI summary of creator comment): - Models must be listed on lmarena to be counted.
- Examples:
- o1 pro does not count since it's not on the arena.
- Regular o1 does count.

Market context

Technical AI Timelines

OpenAI

Anthropic

LLMs

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ1,066
2		Ṁ247
3		Ṁ130
4		Ṁ96
5		Ṁ61

2 Comments

48 Holders

107 Trades

Sort by:

This is resolved to "nearly no" as it's not really fair to the question to resolve NA (Opus 4 did not beat o3), but technically, Opus "3.5" was never released (with that name).

If the model is not on lmarena, then it will not count. For example, o1 pro does not count now since it's not on the arena. Regular o1 does count.

People are also trading

Will GPT-5.4 outperform Claude Opus 4.6 at METR 50% time horizon?

34% chance

OpenAI beats Grok on arena.ai before June?

70% chance

Will OpenAI ever top the LMArena leaderboard again before 2030?

87% chance

🏅 Top traders

People are also trading

Related questions