Will any open-source model achieve GPT-4 level performance on MMLU through 2024?
24
1kṀ13k
resolved Dec 9
Resolved
YES

GPT-4 currently leads the Multi-task language understanding benchmark [1] at 86.4% [2]. Will any open-source language model achieve at least 86.4% on MMLU average?

A leaderboard of open-source models can be found here.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ432
2Ṁ329
3Ṁ49
4Ṁ46
5Ṁ41
© Manifold Markets, Inc.TermsPrivacy