MANIFOLD
Will a multi-agent system have its time horizon evaluated by METR before August 2026?
4
Ṁ1kṀ640
Jul 31
29%
chance

METR's time horizon evaluation: https://metr.org/time-horizons/

Some existing multi-agent systems: GPT-5.2 Pro, Grok 4 Heavy, Gemini 3 Deep Think.

This market doesn't count "regular" models being able to spawn subagents. For example, if the reported evaluated model is Claude Opus 4.6, but the evaluation was made within Claude Code where Claude Opus 4.6 could spawn some Claude Sonnet 4.6 subagents, this does not count for the purpose of this market.

Market context
Get
Ṁ1,000
to start trading!
Sort by:

GPT-5.2-Pro is actually available via API right now which I think should simplify the evaluation process quite a lot

© Manifold Markets, Inc.TermsPrivacy