Will an AI system beat humans in the GAIA benchmark in 2024?
12
190Ṁ809resolved Jan 4
Resolved
NO1H
6H
1D
1W
1M
ALL
GAIA: a benchmark for General AI Assistants
paper: https://arxiv.org/abs/2311.12983
GPT4 scored only 15% compared to 92% for humans
I will resolve based on open source results or official reputable company announcements/papers.
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Name | Total profit |
|---|---|---|
| 1 | Ṁ33 | |
| 2 | Ṁ17 | |
| 3 | Ṁ16 | |
| 4 | Ṁ11 | |
| 5 | Ṁ5 |
Sort by:
@JanProvaznik Resolves as NO. AI Assistants still score much worse than humans:

@mods Resolves as NO. The market creator has been inactive for two months. The resolution should be very clear: GAIA's leaderboard shows AI assistants are still significantly behind humans.
People are also trading
Related questions
Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?
35% chance
By when will AIs perform at least as well as humans on GAIA?
Will AIs beat human experts in question-answering on the GPQA benchmark before January 1st, 2027?
95% chance
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
64% chance
Will AI beat top Magic the Gathering human player before the end of 2026?
12% chance
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
50% chance
Will an AI system similar to Auto-GPT make a successful attempt to kill a human by 2030?
27% chance
Will an AI by OpenAI beat a super grandmaster playing chess by 2028?
57% chance
In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?
2036
Will AI beat top human players at Civ6 (without cheating) by EOY 2026?
20% chance