
I will run a Manifold poll 1 month after the official GPT-5 release asking whether or not it exceeded expectations. Resolves to results of that poll.
Update 2025-22-01 (PST): - The market closure date has been extended to July 1, 2025 following updates on the GPT-5 release schedule. (AI summary of creator comment)
People are also trading


All that for +21 Elo score?
@TheAllMemeingEye bruh it's only 257 above GPT-3.5 from way back in 2022-23, so almost a fifth of the time the model from years ago will beat it XD

@TheAllMemeingEye It (or a classifier) chooses how much reasoning to devote to a query.
Not an actual technical term as far as I know
If anyone wants to keep trading https://manifold.markets/Thomas42/how-will-gpt-5-exceeds-expectations
EDIT: Poll will be posted in one month (per description)
Closed so no arbitrage possible
https://manifold.markets/PhilosophyBear/when-gpt5-comes-out-will-more-manif
https://manifold.markets/strutheo/will-nvidia-stock-nvda-be-worth-188
This market should be correlated with this one, no?
@jessald Very loosely at best, I hardly expect that the performance of one model is going to have a big effect on Nvidia's stock
In an interview Dario said that the average lay person wouldn't be able to appreciate an AI with for example PhD level chemistry knowledge because they simply wouldn't understand or even ask for information that would make use of or reveal this knowledge I feel like gpt5 could be an iceberg and most of the respondents in the future poll will only see the surface.
@jessald Everyone knows something & they could test it on that. That works if you're a car mechanic too. Besides, you'd hope to see better instruction following
@jessald Dario's point is 100% true in theory but in practice I feel the opposite has happened, at least in some forums: people unable to actually judge models' performance in, say, advanced mathematics, go around claiming models are way better than they actually are.
@NBAP They should use 4.5, no?
Everyone just memory-holed 4.5! It's such an awful model, I keep trying to use it and it really can't hold a candle to o3 and it's just as slow.