
Mistral Large 2 outperforms Llama 3.1 405b Instruct on Chatbot Arena on August 12th?
50
10kṀ79kresolved Aug 13
Resolved
NO1H
6H
1D
1W
1M
ALL
Mistral Large 2 reportedly outperforms on Arena Hard as well as MT Bench:


This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
| # | Name | Total profit |
|---|---|---|
| 1 | Ṁ6,701 | |
| 2 | Ṁ1,026 | |
| 3 | Ṁ681 | |
| 4 | Ṁ613 | |
| 5 | Ṁ459 |
People are also trading
Which LLM will have the highest ELO at the end of 2025 on ChatBot Arena?
any model on the Chatbot Arena LLM Leaderboard at https://lmarena.ai/ reaches an Arena Score of at least 1520.0 (≥1520)
28% chance
DeepSeek v3.2 vs Mistral Large 3: Which one gets a better LMArena Elo? (without style-control)
Llama 5 outperforms GPT 4o on LM Arena?
85% chance
First model 1500+ in Chatbot Arena?
5/31/26
Sort by:
Mistral Large 2 is now on the leaderboard with 1248, 15 points short of 405B
nope! haven't been following it but surprised by how low it is.
It catches up quite a bit on math, coding but still doesn't pass
Does this measure the difference at a single point, or or once per day, continuously? I.e. "ever outperform before" vs "outperform on"
I think the right choice for markets like this is "outperform on", because otherwise they'll move up and down with noise and that biases the market towards yes vs what you want to ask which is 'is it better'
People are also trading
Related questions
Which LLM will have the highest ELO at the end of 2025 on ChatBot Arena?
any model on the Chatbot Arena LLM Leaderboard at https://lmarena.ai/ reaches an Arena Score of at least 1520.0 (≥1520)
28% chance
DeepSeek v3.2 vs Mistral Large 3: Which one gets a better LMArena Elo? (without style-control)
Llama 5 outperforms GPT 4o on LM Arena?
85% chance
First model 1500+ in Chatbot Arena?
5/31/26