Will any open-source model rank higher than GPT-4 on ChatBot Arena in 2024? (according to ELO Rating)
121
1.3kṀ50k
resolved Apr 9
Resolved
YES
I'm guessing by open source you mean the weights are freely available and not that the training code and data have to also be open source?

I saw a question titled "GPT4 or better model available for download by EOY 2024?" and liked it. Still, I wanted another one with more objective and straightforward resolution criteria.

We use a loose definition of open-source that encompasses all previous versions of llama. In essence if it is theoretically possible for anyone to download the weights and run the model then it is considered opensource.

This market resolves yes if any open-source model achieves an ELO rating that ranks it higher than GPT-4 on ChatBot Arena at any point in 2024. New versions of GPT-4 do not count. The comparison will be done to the earliest GPT-4 version

FAQ

  • What is ChatBot Arena?

    ChatBot Arena is a benchmark platform for large language models (LLMs) that ranks AI models based on their performance. It uses the Elo rating system, widely adopted in competitive games and sports, to calculate the relative skill levels of AI models. This rating system is particularly effective for pairwise comparisons between models. In ChatBot Arena, users can interact with two anonymous AI models, compare their responses side-by-side, and vote for the one they find better. This crowdsourced approach contributes to the Elo rating of each model.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ2,199
2Ṁ260
3Ṁ234
4Ṁ121
5Ṁ118
© Manifold Markets, Inc.TermsPrivacy