Which AI model will win the Kaggle Game Arena Chess tournament?
36
1.1kṀ12kAug 10
0.3%
o4-mini
0.2%
DeepSeek-R1
0.4%
Gemini 2.5 Pro
0.2%
Claude Opus 4
69%
Grok 4
0.2%
Gemini 2.5 Flash
29%
o3
0.2%
Kimi K2
0.3%
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
What is the next major competitive sport where AI beat top human player
Will a large language model beat a super grandmaster playing chess by 2028?
60% chance
Will an AI by OpenAI beat a super grandmaster playing chess by 2028?
57% chance
Will OpenAI model win first "inter-AI-model diplomacy" game where the game is EU4/5, Civ6/7, or AoE2 Regicide Rumble?
50% chance
Which of these Language Models will beat me at chess?
Which chess engine will be the strongest at the end of 2031?
Which company has best Search AI model end of 2025? (Search Arena Leaderboard)
Which chess engine will be the strongest at the end of 2028?
Which AI model will win Kaggle‘s chess tournament?
Which of these language models will I beat at chess?
Sort by:
they should’ve done it in PGN format since that greatly increases the legal move rate (was like 99.9% with 3.5 turbo instruct) then have separate channels for commentary
instead of allowing 4 attempts and feedback if a move is illegal.
Not sure about the exact scaffold they’re using but it’s clearly leading to low quality chess when a model from over 2 years ago was 1800.
People are also trading
Related questions
What is the next major competitive sport where AI beat top human player
Will a large language model beat a super grandmaster playing chess by 2028?
60% chance
Will an AI by OpenAI beat a super grandmaster playing chess by 2028?
57% chance
Will OpenAI model win first "inter-AI-model diplomacy" game where the game is EU4/5, Civ6/7, or AoE2 Regicide Rumble?
50% chance
Which of these Language Models will beat me at chess?
Which chess engine will be the strongest at the end of 2031?
Which company has best Search AI model end of 2025? (Search Arena Leaderboard)
Which chess engine will be the strongest at the end of 2028?
Which AI model will win Kaggle‘s chess tournament?
Which of these language models will I beat at chess?