Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
104
1kแน€16k
Nov 30
78%
chance

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

See also:

/Bayesian/gemini-3s-50-time-horizon-per-metr

Get
แน€1,000
to start trading!
Sort by:

Is this intended to be closed?

@AshDorsey No, thanks

Bearish

boughtแน€250YES

@CameronHolmes do you want to bet more

@Bayesian erm probably not at this point I'm afraid and especially not if you are offering! ๐Ÿ˜‚

@CameronHolmes ๐Ÿ˜ญ

bought แน€150 NO

what about at 43%

@Bayesian I'd buy 500 YES at 43% still interested?

im taking orders of 3000 mana or more at 50% today ping me

ยฉ Manifold Markets, Inc.โ€ขTermsโ€ขPrivacy