Claude Sonnet 4.6 METR 50% time horizon
18
1.2kṀ5643
Mar 2
0.6%
< 2h
0.9%
2h - 2h30
1.1%
2h30 - 3h
6%
3h - 3h30
7%
3h30 - 4h
8%
4h - 4h30
13%
4h30 - 5h
16%
5h - 5h30
16%
5h30 - 6h
11%
6h - 6h30
20%
Other

This market will resolve to the highest 50% time horizon, as reported by METR, for the first Claude Sonnet 4.6 thinking model to appear on METR's graph. Claude Sonnet 4.7 counts for the purpose of this market, if 4.6 is skipped. So does 4.75, but 5 would not count.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

See also:

/jim/gpt-52-metr

/jim/claude-45-opuss-metr50-horizon (jim's original market)

/Bayesian/claude-opus-45s-metr50-time-horizon (my version)

/Bayesian/gemini-3s-50-time-horizon-per-metr

/Bayesian/grok-420s-metr-50-time-horizon

/Bayesian/claude-sonnet-46s-metr-50-time-hori (this market)

/Bayesian/grok-5s-50-time-horizon-per-metr

/Bayesian/r2s-50-time-horizon-per-metr

/Bayesian/kimi-k3-thinkings-metr-50-time-hori

  • Update 2025-12-22 (PST) (AI summary of creator comment): If Claude Sonnet 4.5 (new) is released instead of 4.6, this market will resolve N/A.

Market context
Get
Ṁ1,000
to start trading!
Sort by:

The members of the AI futures project have given an update and they appear to now be relying on the 80% time horizon length graph from METR for their predictions rather than the 50% time horizon length graph. This implies that a 50% time horizon is not enough. While I think markets for 50% time horizons are useful, I now think that more attention needs to be paid to 80% time horizon lengths. I am planning to create markets for 80% time horizons as soon as possible unless someone beats me to it.

Would Claude sonnet 4.5 (new) count lol

@satchlj 😭 claude 4.5 new resolves N/A 😤

© Manifold Markets, Inc.TermsPrivacy