Best METR 50% Time Horizon in 2026

MANIFOLD

Ṁ2.8kṀ19k

Dec 31

98.8%

>16h

98.7%

>18h

98%

>19h

97%

>20h

96%

>21h

94%

>22h

92%

>24h

91%

>26h

89%

>28h

85%

>30h

82%

>32h

76%

>36h

73%

>38h

66%

>40h

61%

>45h

57%

>50h

50%

>55h

44%

>60h

39%

>70h

32%

>80h

See https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/

Resolves to the longest 50% Time Horizon, as measured by METR, for any AI system, by the end of 2026. Answers that are passed early can be resolved early.

IMPORTANT: Resolves to all thresholds exceeded, not just to the highest one that applies. eg for a 11 hour time horizon, ">10h" resolves yes, but so does >6h and >8h

Market context

Technology

Technical AI Timelines

OpenAI

AI Impacts

Get

1,000

to start trading!

People are also trading

Best AI time horizon by August 2026, per METR?

GPT 5.4 METR 50% time horizon

Grok 4.20 METR 50% time horizon

R2 / V4-Thinking METR 50% time horizon

What will be the METR time horizon doubling time in 2026?

GPT 5.2 Pro METR time horizon

Claude Sonnet 4.6 METR 50% time horizon

Will METR retire the 50% Time Horizon by EOY 2026

46% chance

What will the frontier METR time horizon be on January 1, 2027?

Will the METR 50% Time Horizon be "ambiguous" at the end of 2026?

Sort by:

I don’t know if this can be fixed but 12 hours should no longer be resolved. METR updated their graph and Claude Opus 4.6 is at 11 hours 59 minutes.

@MaxLennartson hmmm i'll let previous resolutions be binding even if they change the methodology throughout the year

@Bayesian '>14h' can resolve as 'yes' because of the Opus 4.6 measurement (14 hours and 30 minutes). Thanks!

The members of the AI futures project have given an update and they appear to now be relying on the 80% time horizon length graph from METR for their predictions rather than the 50% time horizon length graph (correction: they have always used the 80% time horizon length). This implies that a 50% time horizon is not enough. While I think markets for 50% time horizons are useful, I now think that more attention needs to be paid to 80% time horizon lengths.

@MaxLennartson Source: https://www.aifuturesmodel.com/#section-timehorizonandtheautomatedcodermilestone

@MaxLennartson Well, now the 50% time horizon measure is saturated.

@Haiku Sorry I am confused about your comment?

@MaxLennartson You had said that markets for the 50% time horizon were useful. But with the release of the 50% time horizon for Claude 4.6 Opus, METR said that it's hard to measure now because the benchmark is saturated. Basically, we don't actually know what the time horizon of Claude 4.6 Opus is. They're continuing to work on updating the time horizon benchmark, but the new version might be saturated at 50% by the time it gets released. I expect them to retire the 50% benchmark and add a 95% benchmark.

@Haiku I think the 50% time horizon graph is still useful to varying degrees but METR will probably retire the graph at some point. We do have a time horizon for Claude Opus 4.6. I think the 80% time horizon graph is the most useful especially for timelines. I doubt that METR will create a 95% or even a 100% time horizon graph because they don’t have tasks that fit the criteria.

opened a Ṁ10,000 YES at 59% order

@bens

@Bayesian hmmmm

@Bayesian might depend on the methodology announced by METR for those longer tasks

bought Ṁ1,500 NO

@Bayesian but I’ll take some now just for kicks

bought Ṁ30 NO

I very roughly polled METR staff (using Fatebook) what the 50% time horizon will be by EOY 2026, conditional on METR reporting something analogous to today's time horizon metric.
I got the following results: 29% average probability that it will surpass 32 hours. 68% average probability that it will surpass 16 hours.
The first question got 10 respondents and the second question got 12. Around half of the respondents were technical researchers. I expect the sample to be close to representative, but maybe a bit more short-timelines than the rest of METR staff.
The average probability that the question doesn't resolve AMBIGUOUS is somewhere around 60%.

opened a Ṁ50 NO at 65% order

@Bayesian am i misinformed or outdated now? If the doubling period was 7 months or whatever these estimations seem quite optimistic on an increase in the doubling speed!

@No_uh the doubling times havent been 7months, closer to 4.5-5 months

@Bayesian were they never 7 months? am i just misremembering? or did they seem to shift not too long ago?

@No_uh oh yeah they were 7 months, and for a while they were consistent eith 4 month to 7month range, and that uncertainty is slightly narrowing over time

@Bayesian Yes, that makes sense. it looks like I just am not up to date on the narrowing. I'm only human lmao, seems I already cannot keep up. Welp, enjoy my free mana everyone ;)

exit: and thank you as always Bayesian for responding!

@Bayesian I am personally going with a five month doubling time.

People are also trading

Best AI time horizon by August 2026, per METR?

GPT 5.4 METR 50% time horizon

Grok 4.20 METR 50% time horizon

R2 / V4-Thinking METR 50% time horizon

What will be the METR time horizon doubling time in 2026?

GPT 5.2 Pro METR time horizon

Claude Sonnet 4.6 METR 50% time horizon

Will METR retire the 50% Time Horizon by EOY 2026

46% chance

What will the frontier METR time horizon be on January 1, 2027?

Will the METR 50% Time Horizon be "ambiguous" at the end of 2026?

63% chance

People are also trading

People are also trading

Related questions