Will a single model running on a single consumer GPU (<1.5k 2020 USD) outperform GPT-3 175B on all benchmarks in the original paper by 2025?
27
117
αΉ3.9KαΉ431
2026
85%
chance
1D
1W
1M
ALL
There are no restrictions on the amount or kind of compute used to *train* the model. Question is about whether it will actually be done, not whether it will be possible in theory. If I judge the model to really be many specific models stuck together to look like one general model it will not count.
Get αΉ200 play money
Related questions
Sort by:
Llamas on pixel 7s https://github.com/rupeshs/alpaca.cpp/tree/linux-android-build-support (ik ik its not over 13B yet, just sharing progress)
@ValeryCherepanov By "run on a single GPU" I mean the weights + one full input vector can fit on a consumer GPU at once. Otherwise the question would be meaningless - you can always split up matrices into smaller blocks and run the computation sequentially.
More related questions
Related questions
Will there be a model that has a 75% win rate against the latest iteration of GPT-4 as of January 1st, 2025?
47% chance
Will an open source model beat GPT-4 in 2024?
65% chance
When will a GPT-4 class model run on a single consumer PC?
Will any Google model exceed chatGPT interest? (by 2025)
29% chance
Will a model be trained using at least as much compute as GPT-3 using AMD GPUs before Jan 1 2026?
71% chance
Will we have an open-source model better than GPT-4-Turbo before 2025?
62% chance
Will any Deepmind model exceed chatGPT interest? (by 2025)
28% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
55% chance
Will any open-source model achieve GPT-4 level performance on MMLU through 2024?
83% chance
Will there be an AI language model that surpasses ChatGPT and other OpenAI models before the end of 2025?
64% chance