Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?

Question

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

See also:

/Bayesian/gemini-3s-50-time-horizon-per-metr

[image]See also:

@/Bayesian/gemini-3s-50-time-horizon-per-metr

Manifold Markets · Accepted Answer

Yes — resolved on Feb 3, 2026 by Manifold Markets prediction market.

#	Trader	Total profit
1		Ṁ1,425
2		Ṁ881
3		Ṁ680
4		Ṁ631
5		Ṁ310

🏅 Top traders

People are also trading

People are also trading

Related questions