Will there be a gpt-4 quality LLM with distributed inference by the end of 2024?

1kṀ8907

resolved Jan 2

Resolved

ALL

The model has a Elo greater than 1190 on ChatbotArena (or if ChatbotArena is no longer available/updating, achieves GPT 4 (03.14) equivalent or greater performance on both MMLU and MT-Bench)
When running inference in a geographically distributed fashion (the computational hardware is not colocated, and is networked over typical consumer equipment)

on heterogeneous hardware (the computational hardware is varied in type, e.g. different GPU models)

without the act of distributed inference causing the model to require 2 OOM more energy usage (e.g. if doing so Is incredibly lossy and inefficient, it does not count. The burden of proof lies on anyone claiming this clause should be activated)

Note: if (or, when) an edge case is presented, it's applicability to this question will be evaluated in mine + Robert's understanding of the spirit of the question.

Technical AI Timelines

GPT-4 speculation

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ162
2		Ṁ134
3		Ṁ106
4		Ṁ66
5		Ṁ65

People are also trading

Will we see a public GPU compute sharing pool for LLM model training or inference before 2026 ?

86% chance

China will make a LLM approximately as good or better than GPT4 before 2025

89% chance

When will an open-source LLM be released with a better performance than GPT-4?

Will there be a state-of-the-art LLM that is NOT based on next raw token prediction before 2029?

50% chance

Will there be any major breakthrough in LLM continual learning before 2028?

67% chance

Will LLMs such as GPT4 be considered a solution to Moravec’s paradox by 2030?

20% chance

How much time will pass between an LLM being released that beats GPT4 and the next OpenAI LLM being released? (+ANSWERS)

Will there be any major breakthrough in LLM continual learning before 2029?

Sort by:

bought Ṁ2,000 NO

Resolves no. Creator deleted account @mods

predictedYES

@firstuserhere I'm curious what updated your prediction from 50% to 30%?

Useful markets for betting:

I'd say rough draft could be (with GPT-4 performance as a baseline):

The model has a Elo greater than 1190 on ChatbotArena (or if ChatbotArena is no longer available/updating, achieves GPT 4 (03.14) equivalent or greater performance on both MMLU and MT-Bench)
when running inference in a geographically distributed fashion (the computational hardware is not colocated, and is networked over typical consumer equipment)
on heterogeneous hardware (the computational hardware is varied in type, e.g. different GPU models).
without the act of distributed inference causing the model to require 2 OOM more energy usage (e.g. if doing so Is incredibly lossy and inefficient, it does not count. The burden of proof lies on anyone claiming this clause should be activated).
Note: if (or, when) an edge case is presented, it's applicability to this question will be evaluated in my understanding of the spirit of the question (is it possible to run inference on a SOTA LLM using a bunch of different peoples computers?).

Note: @firstuserhere my knowledge of LLM's is definitely in the midwit realm - if any of the above clauses (benchmarks?) seem bad feel free to improve. I do suggest we have this question evaluate GPT-4 or equivalent models though as from my read through of the paper, they already have LLAMA 70b running in a distributed fashion which is pretty close to GPT-3.5.