Will an image generator be capable of asking clarifying questions about an ambiguous prompt by the end of 2024?

1kṀ6600

resolved Jan 7

Resolved

ALL

In my opinion, one of the major weaknesses of current LLM-based technology is that it doesn't ask the user clarifying questions when a prompt is ambiguous or otherwise confusing to the model.

Interactive text generators like ChatGPT probably could do it more if they were trained to do so, but I'm more concerned with models that perform a specific task like "generate an image" or "generate some music."

For example, if I ask a current image generator like DALL-E 2 or Stable Diffusion to generate an image of "A woman rescuing a drowning man with a robot arm" right now, it will give me four images with random permutations of women, men, and robots in the vicinity of some water. Compositionality problems aside, this prompt is actually linguistically ambiguous, and a competent artist would want to ask "Is it the woman or the man who has the robot arm?" before producing any artwork.

So, this market will resolve YES if, before the close date, there is a publicly-available image generator that asks the user for additional clarification in some way before generating the final images when prompted with "A woman rescuing a drowning man with a robot arm" (or a similarly ambiguous prompt, if that specific prompt doesn't work for some reason). Resolves NO otherwise. I will not be betting in this market.

Update 2025-01-01 (PST) (AI summary of creator comment): - The image generator must consistently ask for clarification when presented with an ambiguous prompt, rather than only under specific instructions or rarely.
- The clarification behavior should be the normal and expected behavior most of the time, not something that occurs intermittently (e.g., 1 out of 7 times).

Technical AI Timelines

Get

1,000

to start trading!