MANIFOLD
New METR SOTA by EOY?
3
Ṁ100Ṁ565
resolved Dec 18
Resolved
YES

Resolves YES if any model surpasses a 50% time-horizon of 2h 42m on https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ by EOY

  • Update 2025-12-12 (PST) (AI summary of creator comment): The model must be scored by end of year (not just released by end of year).

  • Update 2025-12-18 (PST) (AI summary of creator comment): - Creator states that the GPT 5.1 - codex max score qualifies for a YES resolution.

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#TraderTotal profit
1Ṁ69
2Ṁ11
Sort by:
bought Ṁ300 YES

@creator updated data is out; GPT 5.1 - codex max is now 2 hours 53 minutes

may seem like a cheap win but per the market terms I think it qualifies.

@MRME yeah could go either way on this one but YES resolution seems best given the wording of the question and description

Does it have to be scored by the end of the year or does the model have to be released by the end of the year

@BenAybar scored

© Manifold Markets, Inc.TermsPrivacy