By 2028, will there be a language model of less than 10B parameters that is superior to GPT-4?

1kṀ553

resolved May 14

Resolved

YES

ALL

Pretty impressed by this:

https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/

I will not bet in this market. I will look primarily at whatever benchmarks are most prominent at the time to determine which model is better.

The model does not have to be clearly superior to gpt-4, if it's just a bit better I will resolve YES. If anyone has suggestions for less subjective resolution criteria that won't be goodharted I am open.

Technical AI Timelines

GPT-4 speculation

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ36
2		Ṁ32
3		Ṁ10
4		Ṁ8
5		Ṁ6

People are also trading

By January 2026, will we have a language model with similar performance to GPT-3.5 (i.e. ChatGPT as of Feb-23) that is small enough to run locally on the highest end iPhone available at the time?

93% chance

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

18% chance

Will any 10 trillion+ parameter language model that follows instructions be released to the public before 2026?

14% chance

Will we have an open-source model that is equivalent GPT-4 by end of 2025?

96% chance

Will OpenAI announce a model with a name containing the string "GPT-4b" in 2025

3% chance

Will OpenAI announce a new model that EpochAI estimates is at least as large as GPT-4.5, in 2025?

21% chance

Will a model as great as GPT-5 be available to the public in 2025?

99% chance

Will it be possible to disentangle most of the features learned by a model comparable to GPT-4 this decade?

37% chance

Will it be possible to disentangle most of the features learned by a model comparable to GPT-2 this decade?

84% chance

Will a language model that runs locally on a consumer cellphone beat GPT4 by EOY 2026?

Sort by:

@SemioticRivalry I just noticed this question seems to have some similarity to this other one:

/_deleted_/will-a-15-billion-or-less-parameter

You seem to have a lot of knowledge about this subject and I very much do NOT. Would you be able to review my chain of comments in the other market and confirm them or refute them, and also see if they apply in any way to your own market here?

(My comments there are just completely flailing around, I have no idea what any of it means. Even though the market is resolved I am very willing to reconsider if someone knows more about the subject.)

@Eliza yeah I think this is sufficient

Gemma 2 9b-it is higher than original ChatGPT on llmsys arena. I don’t think it’s better or even on the same level but I think it’s a signal that it’s already close. 27b version I think in the same level

if the model can be stored on 10b params but takes 100000x more time at inference time, because it searches through lots of possibilities or reprompts itself a bunch of times or similar tricks to improve its output, would that still count? In 2023 it would still not be practical to run, but it would still be smaller in the ‘# of parameters‘ sense