GPT-5.2's METR 50% time horizon | Manifold

GPT-5.2's METR 50% time horizon

9

1kṀ1530

resolved Dec 13

ResolvedN/A

2%

<2h

4%

2h - 2.5h

27%

2.5h - 3h

34%

3h - 3.5h

15%

3.5h - 4h

7%

4h - 4.5h

3%

4.5h - 5h

3%

5h - 5.5h

2%

5.5h - 6h

3%

>6h

This market will resolve to the 50% time horizon, as reported by METR, for GPT-5.2.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/jim/claude-45-opuss-metr50-horizon (jim's version)

/Bayesian/claude-opus-45s-metr50-time-horizon (my version)

/Bayesian/gemini-3s-50-time-horizon-per-metr

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

Update 2025-12-12 (PST) (AI summary of creator comment): Only GPT-5.2 itself will count for resolution. Variants like GPT-5.2 Codex will not count, even if they are the only version evaluated by METR.

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Will GPT-5.1 have a longer METR time horizon than Gemini 3?

Will GPT-5.2 surpass 3 hours in METR time horizon?

Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

Claude Opus 4.5's METR-50 time horizon

Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?

R2's METR 50% time horizon

Will a model achieve a METR 50% time-horizon of 4+ hours by the end of 2025?

Grok 5's METR 50% time horizon

Gemini 3's METR 50% time horizon

Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?

Sort by:

https://manifold.markets/jim/gpt-52-metr?r=amlt

bought Ṁ10 YES

@jim frick sorry i n/a

Must this be 5.2 or will 5.2 Codex count if Codex is the only one evaluated by METR?

@JaySocrates good question, only 5.2

People are also trading

Will GPT-5.1 have a longer METR time horizon than Gemini 3?

-21% 1d32% chance

Will GPT-5.2 surpass 3 hours in METR time horizon?

+8% 1d80% chance

Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

+15% 1d87% chance

Claude Opus 4.5's METR-50 time horizon

Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?

-24% 1d24% chance

R2's METR 50% time horizon

Will a model achieve a METR 50% time-horizon of 4+ hours by the end of 2025?

Grok 5's METR 50% time horizon

Gemini 3's METR 50% time horizon

Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?

Related questions

Will GPT-5.1 have a longer METR time horizon than Gemini 3?

Will GPT-5.2 surpass 3 hours in METR time horizon?

Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

Claude Opus 4.5's METR-50 time horizon

Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?

R2's METR 50% time horizon

Will a model achieve a METR 50% time-horizon of 4+ hours by the end of 2025?

Grok 5's METR 50% time horizon

Gemini 3's METR 50% time horizon

Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?

© Manifold Markets, Inc.•Terms•Privacy