
Will Reflection Llama 3.1 70B be proven to beat Llama 3.1 405B Instruct in GPQA by the end of September 2024?
10
100Ṁ1198resolved Sep 30
Resolved
NO1H
6H
1D
1W
1M
ALL
Matt Shumer announced the "world’s top open-source model" on Twitter nearly three days ago and AI Twitter has been going off ever since.
This question specifically uses GPQA as it is the only non-saturated eval in Shumer's original post. Acceptable evidence will be either Hugging Face's Open LLM Leaderboard v2 or other highly credible sources for evaluation data.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ91 | |
2 | Ṁ8 | |
3 | Ṁ4 | |
4 | Ṁ2 | |
5 | Ṁ1 |