How many organizations will have released a GPT-4-level chatbot by the end of 2024?

Ṁ1kṀ3.3k

resolved Jan 2

100%89%

10+

1.1%

1.5%

4-5

6-7

8-9

On January 1, 2025, how many organizations will have released a chatbot which is ranked above (or equal to) GPT-4-0314 in the LMSYS Chatbot Arena Leaderboard? As of March 10, 2024, there are three such organizations: OpenAI, Anthropic, and Google.

The intent of this question is to measure how many companies will have OpenAI-level capacity for developing frontier AI systems (and thus how large of a moat OpenAI has). Thus, see below clarifications on how fine-tunes of a shared base model are treated.

Details on fine-tunes of a single base model: the organization counted for a model M will be the one which pretrained the base model from which M was fine-tuned. For example, if Meta releases a LLaMA-3 model which surpasses GPT-4-0314 on the leaderboard, and then Stanford makes a fine-tuned Vicuña-3 which also surpasses GPT-4-0314, then this would only count for one organization (Meta) and not for two (Meta + Stanford). If LLaMA-3 ranks below GPT-4-0314 but Vicuña-3 ranks above, then it counts as one organization, and Meta is that organization.

Two organizations count as the same organization iff they have the same listed name in the Chatbot Arena Leaderboard.

If GPT-4-0314 no longer appears on the leaderboard on January 1, 2025, then I'll replace it with another comparable GPT-4-series model, according to my judgement. If there are no GPT-4 series models on the leaderboard or the leaderboard no longer exists, then this market will resolve N/A.

Market context

OpenAI

ChatGPT

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ688
2		Ṁ111
3		Ṁ37
4		Ṁ35
5		Ṁ10

Sort by:

This market resolves "10+". By my count, there are 16 qualifying companies: OpenAI, Google, Anthropic, Alibaba, Cohere, Tencent, Amazon, Reka AI, Meta, Zhipu AI, Nvidia, 01 AI, DeepSeek, AI21 Labs, Mistral, xAI.

I think that all of these companies trained at least one qualifying foundation model, but it's possible that I made a mistake and one of them scaffolded existing models or finetuned someone else's model. Nevertheless, the resolution is safely 10+.

People are also trading

What will be the most-used AI chatbot by end of 2026?

Will chatGPT fall below 75% of AI Chatbot market share in 2026?

72% chance

Will chatgpt stop calling itself a "chatbot" by 2027?

35% chance

🏅 Top traders

People are also trading

Related questions