Grok 3 VS Claude Sonnet 3.7: Who wins on the Chatbot model arena?
10
100Ṁ1218resolved Feb 28
1D
1W
1M
ALL
100%99.0%
Grok 3
1.0%
Claude Sonnet 3.7
"Claude sonnet 3.7" will consider whichever version of claude sonnet 3.7 is highest on https://lmarena.ai when claude sonnet 3.7 gets first placed in the arena leaderboard. for example, if there are two separate models placed on the lb (sonnet 3.7 and sonnet 3.7 thinking), whichever one places higher will be considered for this market's resolution.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ125 | |
2 | Ṁ122 | |
3 | Ṁ43 | |
4 | Ṁ16 | |
5 | Ṁ4 |
Sort by:
it's out!
https://lmarena.ai/?leaderboard
3.7 sonnet places 11th, 4th with styleCTL.. looks like grok 3 wins comfortably
People are also trading
Related questions
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
6% chance
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
9% chance
Will Grok 4 Top the Chatbot Leaderboard?
51% chance
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
11% chance
What will be GPT-5's score on LMSYS Chatbot Arena?
In 2028, will I use a chatbot that can win >25% of Turing test games (defined within) where I am the judge?
35% chance