What will be true of the first model to cross 1400 on lmarena.ai?

Ṁ1kṀ4.7k

resolved Mar 1

Resolved

Gemini Exp

Resolved

ChatGPT 4o

Resolved

Gemini 2.0

Resolved

Claude 3.5 Opus

Resolved

Claude 4

Resolved

YES

Grok

Resolved

OpenAI model code named Orion

Resolved

GPT 5

Will resolve if a model stays at or above 1400 for a week and has a 95% CI with a lower bound of at least 1395 at the end of that week (somewhat arbitrary criteria to ensure the score is based on a sufficient amount of votes)

Will N/A if they change the scoring significantly so that a current model passes 1400.

Current rankings (11/22/24):

Gemini Exp 1121: 1365
ChatGPT 4o Latest (2024-11-20): 1360
Gemini Exp 1114: 1343
o1 preview: 1334
o1 mini: 1308
Gemini 1.5 Pro-002: 1301
Grok 2 0813: 1289
Yi Lightning: 1287
GPT 4o 2024-05-13: 1285
Claude 3.5 Sonnet (20241022): 1282

Update 2025-24-01 (PST): - If a Deepseek model is first to cross 1400, all will resolve to NO (AI summary of creator comment)

Market context

Technology

Technical AI Timelines

OpenAI

LLMs

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ128
2		Ṁ128
3		Ṁ66
4		Ṁ66
5		Ṁ14

3 Comments

15 Holders

120 Trades

Sort by:

Resolution criteria

"stays at or above 1400 for a week and has a 95% CI with a lower bound of at least 1395 at the end of that week"

early Grok 3 is over 1400 as of 02-16, so will need to maintain that rating until 02-22 to resolve to yes

bought Ṁ50 NO

If a Deepseek model is first to cross 1400, all of these will resolve NO

@ChinmayTheMathGuy also o3

People are also trading

First model series to cross 1500 on lmarena.ai?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

45% chance

Any LMArena model breaks 1600 Elo by 2027?

-18% 1d54% chance

Any LMArena model breaks 1700 Elo by 2027?

31% chance

Will OpenAI ever top the LMArena leaderboard again before 2030?

86% chance

Any LMArena model breaks 1650 Elo by 2027?

58% chance

Will a publicly known AI model achieve an 80% time horizon that is an 1 hour and 30 minutes by September 2026?

84% chance

First model 1500+ in Chatbot Arena?

5/18/26

Which AI model will first pass the Longbets version of the Turing test?

AI model achieves superhuman ELO on Codeforces by June 1st 2027?

55% chance

🏅 Top traders

People are also trading

Related questions