Which Benchmarks will OpenAI show results from GPT-5 on, when it is announced?
21
1.1kṀ9104
resolved Sep 29
Resolved
YES
SimpleQA
Resolved
YES
HumanEval
Resolved
YES
MMLU
Resolved
YES
GPQA
Resolved
YES
SWE-Bench
Resolved
YES
ARC-AGI-2
Resolved
NO
GSM8K
Resolved
NO
MATH
Resolved
NO
MGSM
Resolved
NO
DROP
Resolved
NO
Big-Bench-Hard

Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.

  • Update 2025-05-11 (PST) (AI summary of creator comment): The benchmarks must be those that GPT-5 is benchmarked against by OpenAI.

Must be on roughly the same day / during / around the time of the announcement. If there are several announcements over multiple days, all those times are acceptable for the purpose of this market.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ341
2Ṁ160
3Ṁ100
4Ṁ71
5Ṁ61
Sort by:

@Bayesian resolve please

@traders has anyone seen them announce scores on the current NO ones? If so I'll reresolve

bought Ṁ10 NO

you mean benchmarked by OpenAI?

@bbb I can't add options, I might create a duplicate where i can in a bit.

bought Ṁ30 NO

@bbb Idk if i was actually able to change the settings back then but since then ive learned how to do it, so added arc agi 2

© Manifold Markets, Inc.TermsPrivacy