
Will an AI agent system be able to score at least 40% on level 3 tasks in the GAIA benchmark before 2025.
22
1kṀ4135resolved Jan 1
Resolved
YES1H
6H
1D
1W
1M
ALL
GAIA: a benchmark for General AI Assistants was introduced in Nov. 2023. It contains task for AI agents that test there ability.
An example task is:
"Assuming scientists in the famous youtube video The Thinking Machine (Artificial Intelligence in the 1960s) were interviewed the same year, what is the name of the scientist predicting the sooner thinking machines or robots? Answer using the format First name Last name"
Currently the strongest AI system like GPT4 with plugins or AutoGPT failed to solve any of the level 3 task. This market will resolve to "yes" as soon as an AI/Agent system scores >=40% on level 3 tasks.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ437 | |
2 | Ṁ333 | |
3 | Ṁ46 | |
4 | Ṁ43 | |
5 | Ṁ25 |
People are also trading
Related questions
Will an AI model surpasses o3's matharena.ai 88% Overall score by July 1, 2025?
7% chance
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
60% chance
Will an AI system beat humans in the GAIA benchmark before the end of 2025?
68% chance
What will AI score on TheAgentCompany benchmark in early 2026?
50% chance
By when will AIs perform at least as well as humans on GAIA?
Will an AI score over 80% on FrontierMath Benchmark in 2025
10% chance
Will any AI model score above 95% on GRAB by the end of 2025?
40% chance
Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?
10% chance
Will an autonomous agent resolve 90% of tasks on SWE-bench by 2026?
50% chance
Will any AI solve more than four of AI 2027 Marcus-Brundage tasks in 2025?
28% chance