Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
104
1kแน16kNov 30
78%
chance
1H
6H
1D
1W
1M
ALL
50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

See also:
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Will GPT-5.1 have a longer METR time horizon than Gemini 3?
42% chance
Gemini 3.0 scores 1500+ on Chatbot Arena in 2025? (Without Style Control)
21% chance
Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?
76% chance
Gemini 3's execution time-horizon?
2,028
Will Gemini 3 Pro be better than ChatGPT 5 Thinking in math?
89% chance
Sort by:
boughtแน250YES
People are also trading
Related questions
Will GPT-5.1 have a longer METR time horizon than Gemini 3?
42% chance
Gemini 3.0 scores 1500+ on Chatbot Arena in 2025? (Without Style Control)
21% chance
Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?
76% chance
Gemini 3's execution time-horizon?
2,028
Will Gemini 3 Pro be better than ChatGPT 5 Thinking in math?
89% chance