This market will resolve as 'Yes' if, before 2025, either I or, in the case of my inactivity, a trustworthiness individual, confirms that Stable Diffusion 3 accurately generates 'blue grass, green sky' images in 9 out of 10 attempts.
Fine-Print Details
The evaluation standard is based on a clear, casual observation of predominantly blue grass and green sky, with minor color imperfections allowed (less than 5%).
Please check my previous markets before betting on this market.
Just got this image out of it for the prompt: "An alien world with blue grass in a field. The sky is a deep green, with blots of greener clouds."
Variations of the prompt also work.
@Soli in the YES market you didn't pass the prompt exactly. What sort of prompt engineering is allowed?
i think simple prompt engineering should be allowed since we allowed chatgpt to change the prompt slightly when testing dall-e but nothing super hacky
edit: this is being discussed right now
Hmm, I assumed it wouldn't be allowed since the original DALL-E 3 market said "The prompt should be copied and pasted as "blue grass, green sky," with no extra specifications allowed."
I guess it depends on what you're trying to test. If it's about usefulness, then an internal "prompt optimizer" is just a useful feature, while having to carefully fiddle with the prompt is a big waste of time and compute. If it's about the raw intelligence of the image model itself, then obviously you need to use similar prompts (either similarly optimised or unoptimised) going to the image model.
@jBosc I just found out this wasn't automatically generated (i.e. by whatever model Manifold uses), but regardless, if DALLE-3 passed the test in December surely Stable Diffusion 3 would pass it now.