Will a LLM beat human experts on GPQA by Jan 1, 2025?
57
1kแน€42k
resolved Dec 20
Resolved
YES

GQPA dataset here: https://arxiv.org/abs/2311.12022

"Human expert" means 74%.

Currently, GPT-4 gets 39%.

The LLM is allowed to use external tools (e.g. Google, Wolfram Alpha).

Get
แน€1,000
to start trading!

๐Ÿ… Top traders

#NameTotal profit
1แน€1,198
2แน€880
3แน€731
4แน€621
5แน€605
ยฉ Manifold Markets, Inc.โ€ขTermsโ€ขPrivacy