What organization will have the highest ELO score in the LMSYS Org Chatbot Arena Leaderboard at the end of June, 2024?

2.7kṀ56k

resolved Jul 1

100%97%

OpenAI

0.4%

Alphabet (Google)

Anthropic

0.0%

Mistral

0.0%

🏅 Top traders

#	Name	Total profit
1		Ṁ1,554
2		Ṁ300
3		Ṁ273
4		Ṁ244
5		Ṁ231

People are also trading

Will a chatbot from a Chinese company top the LMSYS leaderboard in 2025?

20% chance

Which LLM will have the highest ELO at the end of 2025 on ChatBot Arena?

Will the LMSYS Chatbot Arena still be 'a thing' in 2027, under the same evaluation method?

36% chance

Is the LMSYS chatbot arena leaderboard trustworthy?

Sort by:

How does this resolve if there's an unknown Chatbot at the top? (Eg when gpt 2 bot was of unknown origin)

@wrhall I hadn't considered that. I think it would be best to wait for the organization to be revealed. If that doesn't happen in an acceptable amount of time, then I'd probably resolve to Other.

bought Ṁ300 YES

https://twitter.com/lmsysorg/status/1790097588399779991

https://ai.meta.com/blog/meta-llama-3/

bought Ṁ500 YES

https://x.com/lmsysorg/status/1777630133798772766

How does this resolve in the case of a tie (like currently) where the top two are within the leaderboard's margin of error and are both ranked "1"?

@benshindel Critical question that needs to be answered

@benshindel Same as last quarter, I'll break ranking ties with ELO scores. If there's still a tie then I'll resolve them 50:50.

This is because ELO scores was all that existed when I created this question.

@HankyUSA Right, but that still doesn’t answer the question. The Elo scores give a margin of error, which is why currently if you look to the left you’ll see the ranks for the top 3 models are all “1”

@benshindel I think I answered your question. I just didn't give you the answer you wanted. I'm sorry, but I feel like I should keep as close to the original meaning of the market question as I can. When I created the market question there were no ranking numbers, just ELO scores.

This is probably going to keep coming up, so I'll add some clarifications to my existing market questions. I'll also create new market questions that are explicitly about the ranking numbers. When I do, I'll let you know.

@benshindel Here's the rank # version.

People are also trading

Will a chatbot from a Chinese company top the LMSYS leaderboard in 2025?

20% chance

Which LLM will have the highest ELO at the end of 2025 on ChatBot Arena?

Will the LMSYS Chatbot Arena still be 'a thing' in 2027, under the same evaluation method?

36% chance

Is the LMSYS chatbot arena leaderboard trustworthy?

49% chance

🏅 Top traders

People are also trading

People are also trading

Related questions