Will Reflection Llama 3.1 70B be proven to beat Llama 3.1 405B Instruct in GPQA by the end of September 2024?

Question

Matt Shumer announced the "world’s top open-source model" on Twitter nearly three days ago and AI Twitter has been going off ever since.

This question specifically uses GPQA as it is the only non-saturated eval in Shumer's original post. Acceptable evidence will be either Hugging Face's Open LLM Leaderboard v2 or other highly credible sources for evaluation data.

Manifold Markets · Accepted Answer

No — resolved on Sep 30, 2024 by Manifold Markets prediction market.

#	Trader	Total profit
1		Ṁ91
2		Ṁ8
3		Ṁ4
4		Ṁ2
5		Ṁ1

🏅 Top traders

People are also trading

Related questions