Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
18
1kṀ2102
Nov 8
60%
chance

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy