Gemini 3 Pro Preview METR 50% time horizon

186

Ṁ1.5kṀ180k

resolved Feb 3

100%15%

3.5h - 4h

0.8%

<1.5h

1.6%

1.5h - 2h

2h - 2.5h

2.5h - 3h

3h - 3.5h

72%

4h - 5h

1.7%

5h - 6h

0.6%

6h - 7h

0.4%

7h - 8h

0.3%

8h - 9h

0.1%

9h - 10h

0.1%

10h - 11h

0.1%

11h - 12h

0.1%

>=12h

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

Left bounds inclusive, right bounds exclusive.

🏅 Top traders

#	Trader	Total profit
1		Ṁ2,205
2		Ṁ1,910
3		Ṁ988
4		Ṁ781
5		Ṁ719

People are also trading

Gemini 3.1 Pro METR 50% time horizon

Gemini 3 Pro GA METR 50% time horizon

GPT 5.4 METR 50% time horizon

Grok 4.20 METR 50% time horizon

R2 / V4-Thinking METR 50% time horizon

GPT 5.2 Pro METR time horizon

Claude Sonnet 4.6 METR 50% time horizon

Grok 5 METR 50% time horizon

Kimi K3 Thinking METR 50% time horizon

Best METR 50% Time Horizon in 2026

48 Comments

163 Holders

2.4k Trades

Sort by:

bought Ṁ10 YES

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

@traders just in case you are not aware, if METR ends up not evaluating the current version of gemini 3, this market will unfortunately resolve N/A. I think it would be worthwhile to create a version that resolves to whatever version does get evaluated though (edit: https://manifold.markets/Bayesian/gemini-3-pro-metr-50-time-horizon)

View original context

jim on day of release

close one!

@Bayesian resolves N/A

@jim we can wait for GA cant we

The members of the AI futures project have given an update and they appear to now be relying on the 80% time horizon length graph from METR for their predictions rather than the 50% time horizon length graph. This implies that a 50% time horizon is not enough. While I think markets for 50% time horizons are useful, I now think that more attention needs to be paid to 80% time horizon lengths.

@MaxLennartson Source: https://www.aifuturesmodel.com/#section-timehorizonandtheautomatedcodermilestone

bought Ṁ10 YES

This market will resolve to the highest 50% time horizon, as reported by METR, for any Gemini 3 model released within a month of the first Gemini 3 announcement.

@Bayesian I think they will evaluate this model. They implied that they would do so soon. However, it sounds like they are doing other models first. Given that Gemini came out in November and that METR is waiting for general access, they probably won’t evaluate the model until sometime in January. I predict that METR will evaluate GPT 5.2 first followed by Grok 4.1 and then Gemini 3.

@MaxLennartson Gemini 3 Pro is in preview, not GA. If they wait for GA to evaluate, and Google deprecates inference for this preview version, they might be evaluating the GA version rather than the current version (preview).

@lumi If they evaluate general access, this market resolves N/A

@Bayesian I personally think this market will resolve N/A because I think METR will wait for general access like they did for Gemini 2.5.

likewise

This is interesting https://x.com/EpochAIResearch/status/1999585226989928650?s=20

@Jolliest

https://x.com/YafahEdelman/status/2002221434270331288

@Bayesian Hello, why did the market got closed ?

why did it got closed ??? there is no answer wtf

@Amonium bc the close date was set too soon. fixed

bought Ṁ20 YES

@Bayesian Thank you.

Why are wee all in hold ?!

https://x.com/GregHBurnham/status/1993509024097292388?s=20

some useful references here perhaps

How does this resolve if METR doesn't evaluate any Gemini 3 model which is released within a month?

@jim I think the “within a month” thing means any model of Gemini’s released within a month of the first announcement, not METR’s analysis

@bens yes, but it's not guaranteed that any Gemini models which meet this condition will be evaluated by METR.

opened a Ṁ25,000 NO at 1.0% order

@jim i'll bet they will

but if they don't then ~~obviously it would resolve to <1.5h~~ jk it would resolve N/A

oh no they probably won't that's devastating i forgot they waited for general access before testing gemini 2.5 pro

sold Ṁ176 NO

@Bayesian yeah

opened a Ṁ7 YES at 28% order

@Bayesian what do you mean by general access?

@MaxLennartson The currently available model is gemini 3 pro preview. General access is when they remove all modifiers and sctually call the model gemini 3 pro in the api and such

@Bayesian It looks like they are calling it Gemini 3 pro.

@MaxLennartson They’re calling it thst to customers to keep it simple but the devs they re calling it gemini 3 pro preview

@Bayesian How long did it take before Gemini 2.5 became general access?

@MaxLennartson around 2-3 months iirc

@Bayesian yeah 2 months from 2.5 preview (but there was 2.5 experimental before that)

bought Ṁ10 YES

@Bayesian Do you think that METR will evaluate the ai models that have been released recently including Gemini 3?

@MaxLennartson not gemini 3, probably opus 4.5 though

@Bayesian Well I would assume that they are probably waiting for Gemini 3 to become general access.

People are also trading

Gemini 3.1 Pro METR 50% time horizon

Gemini 3 Pro GA METR 50% time horizon

GPT 5.4 METR 50% time horizon

Grok 4.20 METR 50% time horizon

R2 / V4-Thinking METR 50% time horizon

GPT 5.2 Pro METR time horizon

Claude Sonnet 4.6 METR 50% time horizon

Grok 5 METR 50% time horizon

Kimi K3 Thinking METR 50% time horizon

Best METR 50% Time Horizon in 2026

🏅 Top traders

People are also trading

People are also trading

Related questions