GPT 5 Exceeds Expectations
342
1kṀ49k
Aug 7
45%
chance

I will run a Manifold poll 1 month after the official GPT-5 release asking whether or not it exceeded expectations. Resolves to results of that poll.

  • Update 2025-22-01 (PST): - The market closure date has been extended to July 1, 2025 following updates on the GPT-5 release schedule. (AI summary of creator comment)

Get
Ṁ1,000
to start trading!
Sort by:

making only 7 mana profit here but winning is winning

I don't know what people's expectations were, exactly, but this is my take:

It seems pretty clear that GPT-5 was not a GPT-3.5 -> GPT-4 moment, but it's good enough at pair programming that it has stolen me back from Gemini 2.5 Pro.

All that for +21 Elo score?

@TheAllMemeingEye bruh it's only 257 above GPT-3.5 from way back in 2022-23, so almost a fifth of the time the model from years ago will beat it XD

I have a take:

It's just o3 RL'ed to have variable thinking. It doesn't feel like a new model at all

@FergusArgyll what does variable thinking mean in this context?

@TheAllMemeingEye It (or a classifier) chooses how much reasoning to devote to a query.

Not an actual technical term as far as I know

EDIT: Poll will be posted in one month (per description)

lmao those are some interesting numbers with that chart

@Balasar chart made by GPT-5

@Balasar Holy shit that's misleading lol

Is there a similar market, but for GPT 5 being underwhelming?

bought Ṁ50 NO

https://manifold.markets/strutheo/will-nvidia-stock-nvda-be-worth-188
This market should be correlated with this one, no?

@jessald Very loosely at best, I hardly expect that the performance of one model is going to have a big effect on Nvidia's stock

bought Ṁ200 NO

Another factor to consider is the demographics of the voters in the future survey.
I wonder if they might be biased?

bought Ṁ935 YES

let’s roll

bought Ṁ250 NO

In an interview Dario said that the average lay person wouldn't be able to appreciate an AI with for example PhD level chemistry knowledge because they simply wouldn't understand or even ask for information that would make use of or reveal this knowledge I feel like gpt5 could be an iceberg and most of the respondents in the future poll will only see the surface.

@jessald Everyone knows something & they could test it on that. That works if you're a car mechanic too. Besides, you'd hope to see better instruction following

@jessald Dario's point is 100% true in theory but in practice I feel the opposite has happened, at least in some forums: people unable to actually judge models' performance in, say, advanced mathematics, go around claiming models are way better than they actually are.

bought Ṁ100 NO

My read is that people are generally expecting a qualitative jump similar to the jump from GPT 3.5 to 4, but that they will use 4o or o3 as the comparison, rather than the original 4.

@NBAP They should use 4.5, no?

Everyone just memory-holed 4.5! It's such an awful model, I keep trying to use it and it really can't hold a candle to o3 and it's just as slow.

© Manifold Markets, Inc.TermsPrivacy