Will an AI agent system be able to score at least 40% on level 3 tasks in the GAIA benchmark before 2025.

22

1kṀ4135

resolved Jan 1

Resolved

YES

1H

6H

1D

1W

1M

ALL

GAIA: a benchmark for General AI Assistants was introduced in Nov. 2023. It contains task for AI agents that test there ability.

An example task is:
"Assuming scientists in the famous youtube video The Thinking Machine (Artificial Intelligence in the 1960s) were interviewed the same year, what is the name of the scientist predicting the sooner thinking machines or robots? Answer using the format First name Last name"

Currently the strongest AI system like GPT4 with plugins or AutoGPT failed to solve any of the level 3 task. This market will resolve to "yes" as soon as an AI/Agent system scores >=40% on level 3 tasks.

Technical AI Timelines

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ437
2		Ṁ333
3		Ṁ46
4		Ṁ43
5		Ṁ25

Sort by:

Currently highest scoring agent does 18.75 on level 3. So we are half way there with a couple of month to go...

predictedYES

Important new submission. Not 100% clear yet wether legit or not.
But "Friday" claims to reach already 45% on level 1 and 6% on level 3
https://twitter.com/mialon_gregoire/status/1750110058090782906

People are also trading

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

-8% 1d45% chance

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI achieve a perfect score on the Miklós Schweitzer Competition before 2035?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will an AI system beat humans in the GAIA benchmark before the end of 2025?

What will AI score on TheAgentCompany benchmark in early 2026?

By when will AIs perform at least as well as humans on GAIA?

Will any AI model score above 95% on GRAB by the end of 2025?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Related questions

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI achieve a perfect score on the Miklós Schweitzer Competition before 2035?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will an AI system beat humans in the GAIA benchmark before the end of 2025?

What will AI score on TheAgentCompany benchmark in early 2026?

By when will AIs perform at least as well as humans on GAIA?

Will any AI model score above 95% on GRAB by the end of 2025?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

© Manifold Markets, Inc.•Terms•Privacy