What is the main reason behind GPT-4o speed improvement relative to GPT-4 base model? | Manifold

What is the main reason behind GPT-4o speed improvement relative to GPT-4 base model?

25

1kṀ2143

2029

82%

Smaller model size (hence, architecture/algorithm improvements)

74%

More/better hardware allocated

40%

Something related to low-level computation efficiency (for example, optimized frameworks)

23%

Other

15%

Better coarser-grained tokenizer

GPT-4 speculation

Get

1,000

to start trading!

People are also trading

Will GPT-5 be released incrementally as GPT4.x for different checkpoints from the training run?

Will GPT-5 have fewer parameters than GPT-4? (1500M subsidy)

Will the performance jump from GPT4->GPT5 be less than the one from GPT3->GPT4?

What will the aggregate improvement of GPT5 be over GPT4 in terms of metrics?

Is GPT-4.5 the base model for o3?

Was GPT-4 trained in 4 months or less?

GPT-4 performance and compute efficiency from a simple architecture before 2026

When will an open-source LLM be released with a better performance than GPT-4?

How many parameters does GPT4o have?

Sort by:

@IhorKendiukhov No option for increased sparsity? That's architectural, but doesn't imply a smaller model size.

did they answer this for gpt-4-turbo?

"Main reason" implies only one of these resolves YES. If the coarse tokens help but aren't the central boost, I assume that option resolves NO?

@MaxHarms Correct. In very ambiguous situations, 2 options may resolve YES, but I expect this to happen in only extreme cases.

It's likely a combination of smaller model and low level optimisations (they are happening all the time, judging by open source solutions). However I find it unlikely that "open" AI will share exact numbers to determine what exactly played the biggest role.

bought Ṁ10 YES

What do quantization count towards?

@Sss19971997 quantization itself would be "something related to low-level computation efficiency".

which base model?

@StephenMWalkerII The first publicly available GPT-4 model released on March 14, 2023.

People are also trading

Will GPT-5 be released incrementally as GPT4.x for different checkpoints from the training run?

Will GPT-5 have fewer parameters than GPT-4? (1500M subsidy)

Will the performance jump from GPT4->GPT5 be less than the one from GPT3->GPT4?

What will the aggregate improvement of GPT5 be over GPT4 in terms of metrics?

Is GPT-4.5 the base model for o3?

Was GPT-4 trained in 4 months or less?

GPT-4 performance and compute efficiency from a simple architecture before 2026

When will an open-source LLM be released with a better performance than GPT-4?

How many parameters does GPT4o have?

Related questions

Will GPT-5 be released incrementally as GPT4.x for different checkpoints from the training run?

Will GPT-5 have fewer parameters than GPT-4? (1500M subsidy)

Will the performance jump from GPT4->GPT5 be less than the one from GPT3->GPT4?

What will the aggregate improvement of GPT5 be over GPT4 in terms of metrics?

Is GPT-4.5 the base model for o3?

Was GPT-4 trained in 4 months or less?

GPT-4 performance and compute efficiency from a simple architecture before 2026

When will an open-source LLM be released with a better performance than GPT-4?

How many parameters does GPT4o have?

© Manifold Markets, Inc.•Terms•Privacy