Before 2026, What will be true of OpenAI's Claimed IMO Gold Performance?
4
175Ṁ76
Dec 31
88%
At least one former IMO medalist will review the model's answers and claim it did not actually achieve Gold
50%
The breakthrough is mostly the result of superior test-time scaling methods
50%
The model that achieved it could earn at least bronze using no more than 100,000 reasoning tokens per question
50%
The model that achieved it was trained with a new reinforcement learning algorithm
50%
It was achieved with the same model OpenAI used to get second place in AtCoder
34%
It was achieved by a model that does not use a standard transformer architecture
34%
I will consider the techniques used to achieve it at least as big of a breakthrough as strawberry

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy