Will Reflection Llama 3.1 70B be proven to beat Llama 3.1 405B Instruct in GPQA by the end of September 2024?

100Ṁ1198

resolved Sep 30

Resolved

ALL

Matt Shumer announced the "world’s top open-source model" on Twitter nearly three days ago and AI Twitter has been going off ever since.

This question specifically uses GPQA as it is the only non-saturated eval in Shumer's original post. Acceptable evidence will be either Hugging Face's Open LLM Leaderboard v2 or other highly credible sources for evaluation data.

Technology

Technical AI Timelines

OpenAI

LLMs

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ91
2		Ṁ4
3		Ṁ2
4		Ṁ1
5		Ṁ1

1 Comment

10 Holders

11 Trades

Sort by:

Reflection accused of being a fraud: https://unrollnow.com/status/1832933747529834747

People are also trading

Will Llama 4 Behemoth top the lmarena?

10% chance

Will lucky llama speak before the end of 2025?

15% chance

Will an open-source LLM under 10B parameters surpass Claude 3.5 Haiku by EOY 2025?

99% chance

Will any foundation model score 75 or higher on The Foundation Model Transparency Index by the end of 2024? (LLama2: 54)

96% chance

🏅 Top traders

People are also trading

Related questions