Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
10
69
260
2027
65%
chance

Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).

If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.

There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.

Get Ṁ200 play money

More related questions