Will an LLM break 1400 ELO on LMSys before February?
Premium
8
Ṁ1899Feb 2
57%
chance
1D
1W
1M
ALL
Google currently leads with Gemini -- which has two models at around 1370
But OpenAI just announced O3 -- which is getting great marks on things like hard science questions.
https://deepnewz.com/ai-modeling/openai-unveils-o3-o3-mini-models-exceeding-human-performance-on-arc-agi-4f05e4f7
The resolution is simple. Will and LMSys update contain a model with 1400 ELO? Cutoff is last day in January (East Coast time).
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Related questions
Related questions
Will an LLM be able to solve a Rubik's Cube by 2025?
25% chance
Which organization will have the top LLM on LMSys on March 1st?
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
19% chance
What organization will have the highest ELO score in the LMSYS Org Chatbot Arena Leaderboard at the end of Dec, 2024?
EOY 2025: Will open LLMs match closed-source LLMs on coding to within 50 ELO points?
36% chance
Llama 3 405B ELO on Lmsys Arena Leaderboard 2 weeks after first appearance?
Will AIs stay below 1453 elo in 2024 on chat.lmsys.org/?leaderboard as predicted by Gary Marcus?
95% chance
Will LLMs mostly overcome the Reversal Curse by the end of 2025?
67% chance
Will another LLM prompting challenge with prizes worth $10k+ be completed by the end of 2024?
17% chance
Can an LLM beat me in chess?
10% chance