
I will test image generation models by asking "Please produce an image of two hands, with the left having # fingers and the right having # fingers". The number of fingers will be between 3 and 7. For a positive resolution, any model I use will need to get 5 images right consecutively.
This is what I currently get:


If I understand the resolution criteria correctly, we're nowhere near this and haven't been making much (if any) tangible progress. On May 24, 2023, here's what I got for DALL-E 2 and Midjourney:


are you going to test models with literally the only the prompt "Please produce an image of two hands, with the left having # fingers and the right having # fingers"
or are you going to accept/test more technical prompts for models like e.g. midjourney too?

Confirmed, that text will be the literal prompt. Midjourney is acceptable as far as .models go though

As in having or showing? Should they be two five-finger hands in all cases, or “alien” hands?








