What capabilities can I elicit from the "im-a-good-gpt2-chatbot" models while they are available? [Add answers]
14
222
แน149แน1k
2025
1D
1W
1M
ALL
62%
The model will suggest a bet on a market on Manifold that ultimately ends up being profitable
33%
The model will suggest a CSS stylesheet for Manifold that you will judge nicer than the default appearance (if the model is told CSS class names)
Two new models called "im-a-good-gpt-chatbot" and "im-also-a-good-gpt-chatbot" appeared on Chatbot Arena today: https://chat.lmsys.org/
If you create an answer in this market, I will attempt to get the model to do whatever it says. The model gets one single prompt (with potentially many messages therein) and I will try to put in somewhere around 5 tries to get the model to attempt the task before resolving no.
Get แน600 play money
Related questions
Sort by:
The model will write a correct mathematical proof in Lean that validates with no errors on the first try.
FWIW if you don't want to install Lean locally (which can be a bit of a pain in the ass), you can paste the generated proof into https://live.lean-lang.org/ instead
@duck_master They give different errors, I'll give the model the benefit of the doubt and resolve YES if it can generate code either of them validates
The model will do better on a coding task of my choice than Claude 3 Opus
Related questions
What is true about gpt2-chatbot?
Will OpenAI's next major LLM (after GPT-4) feature natural and convenient speech-to-speech capabilities?
44% chance
Will any Google model exceed chatGPT interest? (by 2025)
29% chance
What made gpt2-chatbot smarter (than GPT4)?
Will the top chatbot in 2025 "think" before responding to a difficult prompt?
60% chance
Is "gpt2-chatbot" GPT-4's Successor?
90% chance
If the top chatbot in 2025 "thinks" before responding to a difficult prompt, will its thoughts be human-interpretable?
35% chance
Will there be an AI language model that surpasses ChatGPT and other OpenAI models before the end of 2025?
66% chance
Will any Deepmind model exceed chatGPT interest? (by 2025)
28% chance