Will an opensource LLM on huggingface beat an average human at the most common LLM benchmarks by July 1, 2024?
23
1.3kแน€1204
resolved Dec 23
Resolved
N/A

Open source models are measured against ARC, HellaSwag, MMLU, and TruthfulQA on https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard . I've added the following plot to this huggingface space so we can see the progress of open source models over time:

I'm wondering when the average human baseline will be passed. A linear trend indicates July 2024 with a 0.89 pearson coef:

But this trend might not be linear. This questions will resolve Yes if the average human baseline on Open LLM Leaderboard on huggingface is surpassed before July 1, 2024.

Get
แน€1,000
to start trading!
ยฉ Manifold Markets, Inc.โ€ขTermsโ€ขPrivacy