
Mistral Large 2 outperforms Llama 3.1 405b Instruct on Chatbot Arena on August 12th?
50
10kṀ79kresolved Aug 13
Resolved
NO1D
1W
1M
ALL
Mistral Large 2 reportedly outperforms on Arena Hard as well as MT Bench:


This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ6,701 | |
2 | Ṁ1,026 | |
3 | Ṁ681 | |
4 | Ṁ613 | |
5 | Ṁ459 |
Sort by:
Mistral Large 2 is now on the leaderboard with 1248, 15 points short of 405B
nope! haven't been following it but surprised by how low it is.
It catches up quite a bit on math, coding but still doesn't pass
Does this measure the difference at a single point, or or once per day, continuously? I.e. "ever outperform before" vs "outperform on"
I think the right choice for markets like this is "outperform on", because otherwise they'll move up and down with noise and that biases the market towards yes vs what you want to ask which is 'is it better'
Related questions
Related questions
Will Llama 4 be the best LLM in the chatbot arena?
10% chance
Which LLM will have the highest ELO at the end of 2025 on ChatBot Arena?
Llama 3 405B ELO on Lmsys Arena Leaderboard 2 weeks after first appearance?
Will GPT-5 top the LLMSys Chatbot Arena leaderboard within a month of its release?
85% chance
Will a Mamba 7b model trained on 2 trillion tokens outperform Llama2-13B
66% chance
Mistral IPO by EOY?
38% chance
Will Mistral's next model make it to the top 10 models in LLM Arena by the end of 2025?
45% chance
Will Mistral-Large be considered on par or better than GPT-5? (text)
4% chance