
Mistral Large 2 outperforms Llama 3.1 405b Instruct on Chatbot Arena on August 12th?
50
10kṀ79kresolved Aug 13
Resolved
NO1H
6H
1D
1W
1M
ALL
Mistral Large 2 reportedly outperforms on Arena Hard as well as MT Bench:


This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ6,701 | |
2 | Ṁ1,026 | |
3 | Ṁ681 | |
4 | Ṁ613 | |
5 | Ṁ459 |
People are also trading
Sort by:
Mistral Large 2 is now on the leaderboard with 1248, 15 points short of 405B
nope! haven't been following it but surprised by how low it is.
It catches up quite a bit on math, coding but still doesn't pass
Does this measure the difference at a single point, or or once per day, continuously? I.e. "ever outperform before" vs "outperform on"
I think the right choice for markets like this is "outperform on", because otherwise they'll move up and down with noise and that biases the market towards yes vs what you want to ask which is 'is it better'