What will AI score on TheAgentCompany benchmark in early 2026? | Manifold

What will AI score on TheAgentCompany benchmark in early 2026?

7

Ṁ600Ṁ329

Mar 31

44%

chance

1H

6H

1D

1W

1M

ALL

The Agent Company is a benchmark for measuring progress on automated remote workers that's been getting a lot of press. Mostly mocking how poorly AI performed. Which is the point of this market: if you think this research suggests AI is "not coming for your job anytime soon" then bet this down.

The benchmark involves completing contrived tasks meant to simulate running a company. The best score so far is Claude at 24% (I'm guessing GPT-o3 will do better).

This market resolves-to-PROB at whatever score the best AI achieves by market close. If the benchmark is saturated, we'll resolve early to 100% (YES). Note that this market can't resolve NO but it can theoretically resolve as low as 24%.

Update 2026-05-21 (PST) (AI summary of creator comment): The creator is considering resolving to 44% as a best guess, since the benchmark does not appear to be actively maintained with new results. The creator is open to arguments for a different value before resolving.

Market context

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

What will happen in 2026 related to AI?

Chatbot Arena: How high will AI score in 2026?

Will any AI model score above 95% on ARC-AGI-2 by end of 2026?

Will any AI achieve a score of 25% on ARC-AGI-3 by the end of 2026?

In what year will AI achieve a score of 85% or higher on the SimpleBench leaderboard?

The AI company with the smartest AI system by the end of 2026

In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?

Will I automate Vanguard rebalancing with an AI agent by 2026?

Major company suffers serious damage from AI agent in 2026?

In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?

Sort by:

They're not keeping it up-to-date it seems. Is it that the methodology is so dated it's not worth new experiments? Any idea?

@Popsicle2338 Any objections to resolving to 44% as our best guess?

@traders Going once...

@dreev practical solution but wondering whether you’d endorse 44% as your best guess? (Not that it matters for the resolution, just curious.)

@Popsicle2338 Well, I don't have a better guess. I can be easily swayed by arguments if anyone has any.

People are also trading

What will happen in 2026 related to AI?

Chatbot Arena: How high will AI score in 2026?

Will any AI model score above 95% on ARC-AGI-2 by end of 2026?

Will any AI achieve a score of 25% on ARC-AGI-3 by the end of 2026?

In what year will AI achieve a score of 85% or higher on the SimpleBench leaderboard?

The AI company with the smartest AI system by the end of 2026

In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?

Will I automate Vanguard rebalancing with an AI agent by 2026?

Major company suffers serious damage from AI agent in 2026?

In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?

Related questions

What will happen in 2026 related to AI?

Chatbot Arena: How high will AI score in 2026?

Will any AI model score above 95% on ARC-AGI-2 by end of 2026?

Will any AI achieve a score of 25% on ARC-AGI-3 by the end of 2026?

In what year will AI achieve a score of 85% or higher on the SimpleBench leaderboard?

The AI company with the smartest AI system by the end of 2026

In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?

Will I automate Vanguard rebalancing with an AI agent by 2026?

Major company suffers serious damage from AI agent in 2026?

In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?