Will a LLM beat human experts on GPQA by Jan 1, 2025?
32
226
1k
2025
38%
chance

GQPA dataset here: https://arxiv.org/abs/2311.12022

"Human expert" means 74%.

Currently, GPT-4 gets 39%.

The LLM is allowed to use external tools (e.g. Google, Wolfram Alpha).

Get Ṁ600 play money

More related questions