
I will test image generation models by asking "Please produce an image of two hands, with the left having # fingers and the right having # fingers". The number of fingers will be between 3 and 7. For a positive resolution, any model I use will need to get 5 images right consecutively.
This is what I currently get:

🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ413 | |
2 | Ṁ408 | |
3 | Ṁ393 | |
4 | Ṁ387 | |
5 | Ṁ253 |
People are also trading
@Jacy The text prompt itself needs to be the exact language in the description. (Which I think is pretty clear logically)
@CarsonGale I'm realizing that Dalle via ChatGPT seemingly takes your text input and rewrites it to make it better....I'm fine with this functionality as long as my text input is verbatim
@CarsonGale Thanks for clarifying. I see how the ChatGPT + DALL-E 3 system as a whole could be considered an "image generation model" per the resolution criteria.
@JimHays I'm not sure why DALLE3 led to an increase. What I hear is mostly that it seems below expectations. I guess announcements are announcements.
I at first interpreted the title to be about having the right/standard number of fingers, since that's what is usually brought up.
For the actual market, I doubt this will happen. They've gotten better at drawing hands, whether through finetuned SD models or whatever. However, I doubt anyone will train one specific for this task.
I think this is entirely possible via collecting a bunch of images and prompts in your format, but I doubt anyone's going to bother.
We could potentially end up with an image generation model with significantly better generalization, but I doubt that we will before the end of this year.
Also 5 consecutive is kindof a strong requirement.
@SB1cca I think using different prompts would unequivocally be against the market description which states "I will test image generation models by asking "Please produce an image of two hands, with the left having # fingers and the right having # fingers"."