
Which Benchmarks will OpenAI show results from GPT-5 on, when it is announced?
21
1.1kṀ9104resolved Sep 29
Resolved
YESSimpleQA
Resolved
YESHumanEval
Resolved
YESMMLU
Resolved
YESGPQA
Resolved
YESSWE-Bench
Resolved
YESARC-AGI-2
Resolved
NOGSM8K
Resolved
NOMATH
Resolved
NOMGSM
Resolved
NODROP
Resolved
NOBig-Bench-Hard
Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.
Update 2025-05-11 (PST) (AI summary of creator comment): The benchmarks must be those that GPT-5 is benchmarked against by OpenAI.
Must be on roughly the same day / during / around the time of the announcement. If there are several announcements over multiple days, all those times are acceptable for the purpose of this market.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ341 | |
2 | Ṁ160 | |
3 | Ṁ100 | |
4 | Ṁ71 | |
5 | Ṁ61 |
People are also trading
Sort by:
@bbb Idk if i was actually able to change the settings back then but since then ive learned how to do it, so added arc agi 2