Will a LLM beat human experts on GPQA by Jan 1, 2025?
30
205
815
2025
49%
chance

GQPA dataset here: https://arxiv.org/abs/2311.12022

"Human expert" means 74%.

Currently, GPT-4 gets 39%.

The LLM is allowed to use external tools (e.g. Google, Wolfram Alpha).

Get Ṁ200 play money

More related questions