Will an AI system beat humans in the GAIA benchmark in 2024?
Basic
12
Ṁ809
resolved Jan 4
Resolved
NO

GAIA: a benchmark for General AI Assistants

paper: https://arxiv.org/abs/2311.12983

GPT4 scored only 15% compared to 92% for humans

I will resolve based on open source results or official reputable company announcements/papers.

Get
Ṁ1,000
and
S3.00
Sort by:

@JanProvaznik Resolves as NO. AI Assistants still score much worse than humans:

https://huggingface.co/spaces/gaia-benchmark/leaderboard

@mods Resolves as NO. The market creator has been inactive for two months. The resolution should be very clear: GAIA's leaderboard shows AI assistants are still significantly behind humans.

@ChaosIsALadder resolving NO per above.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules