Question resolves to its Elo one week after it first appears on the leaderboard.
If [OpenAI / an employee there / whatever org created it] acknowledges this model and gives it a new name (eg, GPT4.5), will resolve to that model's Elo.
If no gpt2-chatbot model is listed on the leaderboard by June 30, will resolve N/A.
Related questions
GPT-4o is on the leaderboard and per the resolution criteria, I will wait one week for the Elo to stabilize and then resolve on Thursday May 23.
Based on this tweet, expecting to resolve this to the GPT-4o Elo
@Uaaar33 Seriously. Though the actual Elo is way closer to my subjective experience (small improvement)
Looks like 2 new potential successors were released "im-a-good-gpt2-chatbot" and "im-also-a-good-gpt2-chatbot". If these both appear on the leaderboard, I'm inclined to use the highest rated Elo to resolve this question.
Even if unacknowledged by OpenAI, I'll count these. (resolves N/A if no successor is released)
Let me know if any concerns with that.
@WilliamKiely I'm guessing this is why: "The public leaderboard will only include models that are accessible to other third parties."