This was shared across twitter.

Will it be confirmed real or completely made up? If they announce benchmark results that are all better than 1% less than the current leaked results, this market resolves YES.
It also resolves yes if the benchmark results were obtained with pass@1024 or something like that
Update 2025-07-01 (PST) (AI summary of creator comment): This market now refers to Grok 4, which the creator considers a rename of the model previously referred to as Grok 3.5.
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ952 | |
2 | Ṁ45 | |
3 | Ṁ14 | |
4 | Ṁ11 | |
5 | Ṁ11 |
People are also trading
the original resolution criteria was
Will it be confirmed real or completely made up? If they announce benchmark results within 1% of each of these except one which can be within 2%, even if they announce the results were for pass@64 or any other not ‘apples to apples’ comparison like that, the market resolves yes.
But when i updated it i accidentally removed the part that would have answered your question