In what year will a GPT3-equivalent model be able to run on consumer hardware?

370Ṁ830

resolved Apr 3

Resolved

2023

ALL

"Consumer hardware" is defined as costing no more than $3,000 USD for everything that goes inside the case (not including peripherals).

In terms of "GPT3-equivalent model," I'll go with whatever popular consensus seems to indicate the top benchmarks (up to three) are regarding performance. The performance metrics should be within 10% of GPT3's. In the absence of suitable benchmarks I'll make an educated guess come resolution time after consulting educated experts on the subject.

All that's necessary is for the model to run inference, and it doesn't matter how long it takes to generate output so long as you can type in a prompt and get a reply in less than 24 hours. So in the case GPT3's weights are released and someone is able to shrink that model down to run on consumer hardware and get any output at all in less than a day, and the performance of the output meets benchmarks, this resolves to whatever year that first happens in.

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ13
2		Ṁ6
3		Ṁ4
4		Ṁ0

People are also trading

In what year will a GPT4-equivalent model be able to run on consumer hardware?

2026

In what year will a GPT4-equivalent model be able to run on consumer hardware?

2026

Will a single model running on a single consumer GPU (<1.5k 2020 USD) outperform GPT-3 175B on all benchmarks in the original paper by 2025?

86% chance

Will a model be trained using at least as much compute as GPT-3 using AMD GPUs before Jan 1 2026?

80% chance

Will a model as great as GPT-5 be available to the public in 2025?

99% chance

Will we have an open-source model that is equivalent GPT-4 by end of 2025?

96% chance

GPT-5 level model runnable on phones by 2030?

41% chance

GPT-4 performance and compute efficiency from a simple architecture before 2026

19% chance

Will a GPT-3 quality model be trained for under $1,000 by 2030?

87% chance

Will a GPT-3 quality model be trained for under $10.000 by 2030?

Sort by:

I've seen enough. Not sure if anything that runs locally is GPT-3.*5* equivalent, but it seems clear we have GPT-3 equivalent stuff right now. And llama.cpp has had a lot of advancements just in the last week in terms of running even faster locally.

predictedLOWER

@firstuserhere

1. "In our experiments, we employed the gpt-3.5-turbo and text-davinci-003 variants of the GPT models as the large language models"

2. "we use ChatGPT to conduct task planning when receiving a user request, select models according to their function descriptions available in HuggingFace, execute each subtask with the selected AI model, and summarize the response according to the execution results."

3. "We built HuggingGPT to tackle generalized AI tasks by integrating the HuggingFace hub with 400+ task-specific models around ChatGPT."