Will an AI system beat humans in the GAIA benchmark in 2024?
12
Ṁ190Ṁ809resolved Jan 4
Resolved
NO1H
6H
1D
1W
1M
ALL
GAIA: a benchmark for General AI Assistants
paper: https://arxiv.org/abs/2311.12983
GPT4 scored only 15% compared to 92% for humans
I will resolve based on open source results or official reputable company announcements/papers.
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ33 | |
| 2 | Ṁ17 | |
| 3 | Ṁ16 | |
| 4 | Ṁ11 | |
| 5 | Ṁ5 |
Sort by:
@JanProvaznik Resolves as NO. AI Assistants still score much worse than humans:

@mods Resolves as NO. The market creator has been inactive for two months. The resolution should be very clear: GAIA's leaderboard shows AI assistants are still significantly behind humans.
People are also trading
Related questions
By when will AIs perform at least as well as humans on GAIA?
Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?
35% chance
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
48% chance
Will AIs beat human experts in question-answering on the GPQA benchmark before January 1st, 2027?
95% chance
Will AI beat top Magic the Gathering human player before the end of 2026?
18% chance
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
60% chance
Will AI beat top Magic the Gathering human player before the end of 2028?
30% chance
Will an AI system similar to Auto-GPT make a successful attempt to kill a human by 2030?
27% chance
Will an AI by OpenAI beat a super grandmaster playing chess by 2028?
57% chance
In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?
2036