Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

Ṁ1kṀ803

resolved Sep 16

Resolved

YES

ALL

Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).

If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.

There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.

Market context

GPT-5 Capabilities

Technology

Technical AI Timelines

OpenAI

Science

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ168
2		Ṁ68
3		Ṁ23
4		Ṁ22
5		Ṁ11

People are also trading

Open-Source AI model gets perfect IMO 2026 score? [International Math Olympiad 2026]

-4% 1d37% chance

In what year will AI achieve a score of 95% or higher on the GPQA benchmark?

8/14/26

Will the gap between open-weights and frontier models on GPQA Diamond be at most 7%?

6% chance

Will any AI model score above 95% on ARC-AGI-2 by end of 2026?

90% chance

What will be the best OpenAI-Proof Q&A score by Dec 31, 2026?

Will OpenAI release a model called GPT-5o in 2026?

2% chance

Will OpenAI's o4 get above 50% on humanity's last exam?

16% chance

🏅 Top traders

People are also trading

Related questions