Will I judge GPT-5 to be smarter than o1 (not preview) after both are released?
20
1kṀ4175
resolved Aug 23
Resolved
YES

Resolves subjectively, based on my analysis of benchmarks both official, third party, and my own.

Some examples of benchmarks I consider are MMLU, ZebraLogic, SWE-bench, simplebench, ARC, and livebench.

Some of my own evals are game-playing (tic-tac-toe, and connect 4), and creative writing (giving a model 3 random nouns and asking it to write a story involving them)

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ147
2Ṁ138
3Ṁ73
4Ṁ67
5Ṁ44
Sort by:

What do you do if no model named GPT 5 will be released, but instead they continue with the oN scheme for all their models?

@yetforever Resolves n/a. Though a departed researcher already described working on GPT-5 so I would be surprised if that happened

© Manifold Markets, Inc.TermsPrivacy