MANIFOLD
Model scores over 50% on GDPVal by EOY 2025?
20
Ṁ1kṀ5.8k
resolved Jan 1
Resolved
YES

Resolves to YES if any model released in 2025 gets over 50% on the newly released GDPVal by April 1, 2026.

Resolves to NO if not.

Resolves to NO as early as January 1, 2026 if there is no plausible model that has not yet been evaluated, that might exceed 50%.

  • Update 2025-12-28 (PST) (AI summary of creator comment): - The scoring method will count wins+ties (not just wins), following OpenAI's official scoring methodology

    • Over 50% wins+ties will be sufficient for YES resolution

    • Creator plans to resolve YES on January 1 based on GPT-5.2-Pro results unless objections are raised

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#TraderTotal profit
1Ṁ185
2Ṁ135
3Ṁ126
4Ṁ98
5Ṁ94
Sort by:

OpenAI publishes official scores that count wins+ties, so this market will do the same. Over 50% wins+ties will be sufficient.

Unless someone argues why I shouldn't, based on GPT-5.2-Pro I will resolve this YES on January 1.

@ZviMowshowitz can we please get clarification on if this is wins only or wins+ties?

@creator counting wins only? or wins+ties?

OpenAI said they jumped from 38.8% to 70.9%

Introducing GPT-5.2 | OpenAI

OpenAI's blog post and paper on GDPval for reference:

https://openai.com/index/gdpval/
https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf12ce/GDPval.pdf

I'm slightly impressed that OpenAI created a new benchmark on which Claude beats them.

it's 2.4% away and it has now become a target

You mean gdpval? I can update the title if you want

@ian I'll just go ahead

© Manifold Markets, Inc.TermsPrivacy