Will Reflection Llama 3.1 70B be proven to beat Llama 3.1 405B Instruct in GPQA by the end of September 2024?
Basic
10
Ṁ1198resolved Sep 30
Resolved
NO1D
1W
1M
ALL
Matt Shumer announced the "world’s top open-source model" on Twitter nearly three days ago and AI Twitter has been going off ever since.
This question specifically uses GPQA as it is the only non-saturated eval in Shumer's original post. Acceptable evidence will be either Hugging Face's Open LLM Leaderboard v2 or other highly credible sources for evaluation data.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
19% chance
Will an open-source LLM beat or match GPT-4 by the end of 2024?
85% chance
Will Meta release a Llama 3 405B multi-modal open source before the end of 2024?
8% chance
Will any foundation model score 75 or higher on The Foundation Model Transparency Index by the end of 2024? (LLama2: 54)
95% chance