Will any image model be able to draw a pentagon before 2025?
Will any image model be able to draw a pentagon before 2025?
294
1.1kṀ100k
resolved Mar 29
Resolved
NO

Current image models are terrible at this. (That was tested on DALL-E 2, but DALL-E 3 is no better.)

The image model must get the correct number of sides on at least 95% of tries per prompt. Other details do not have to be correct. Any reasonable prompt that the average mathematically-literate human would easily understand as straightforwardly asking it to draw a pentagon must be responded to correctly. I will exclude prompts that are specifically trying to be confusing to a neural network but a human would get. Anything like "draw a pentagon", "draw a 5-sided shape", "draw a 5-gon", etc. must be successful. Basically I want it to be clear that the AI "understands" what a pentagon looks like, similar to how I can say DALL-E understands what a chair looks like; it can correctly draw a chair in many different contexts and styles, even if it misunderstands related instructions like "draw a cow sitting in the chair".

If the input is fed through an LLM or some other system before going into the image model, this pre-processing will be avoided if I can easily do so, and otherwise it will not. If the image model is not publicly available, I must be confident that its answers are not being cherry-picked.

Pretty much neural network counts, even if it's multimodal and can output stuff other than images. A video model also counts, since video is just a bunch of images. I will ignore any special-purpose image model like one that was trained only to generate simple polygons. It must draw the image itself, not find it online or write code to generate it. File formats that are effectively code, like an SVG don't count either; it has to be "drawing the pixels" itself.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ11,191
2Ṁ8,371
3Ṁ3,480
4Ṁ2,537
5Ṁ1,398


Sort by:
2mo

Was this market resolved correctly??? I am salty

2mo

@Guilhermesampaiodeoliveir There wasn't any proof from the YES side.

2mo

@bohaska I bet wrong, idk if i believed this or i misbetted.

2mo

What? Haven't It been already 2025 for a while? And did it happen before 2025? Why is it being resolved now?

2mo

@Guilhermesampaiodeoliveir I was late to resolve this, sorry. I'm not sure what the correctness concern would be though, it resolved NO...

2mo

@IsaacKing oh god i bet YES, mfw I did the mistake.

2mo

A more challenging market for this year:

2mo

It’s a few months late but 4o’s image generation got it right for me first try.

2mo

@bence plus an alternative description

2mo

Oh hey, ChatGPT can do octagons too!

2mo

@IsaacKing yeah, shame it didn't make it on time in 2024. It's not a diffusion model, but the market didn't specify the strategy it has to use for image generation.

2mo

Grok 3 actually can consistently draw a pentagon, but only if I use the word "pentagon". It fails as soon as described differently, like calling it a 5-sided shape. That's by far better than DALL-E or any other image model I've seen, which can't even do it with the word "pentagon".

2mo

(Grok 3 only came out in February of this year, so even if the alternative description requirement weren't there this still would have resolved NO.)

5mo

As far as I'm aware this should resolve to NO, though it looks like some models are getting close. Does anyone want to propose another image model that I should test before resolving this?

5mo

@IsaacKing I don't think it's good enough, but https://www.recraft.ai is the closest I've found. I tried "pentagon" with different source image material and got pentagonal results about 60% of the time. Mostly home plate shaped (three right angles, two 135 degree angles), a few regular.

2mo

@IsaacKing Time to resolve, any new results at this point aren't "before 2025"

5mo

ChatGPT-4o appears to do this, but in fact is writing and evaluating python code to do it. Which is IMO an extremely good solution to the problem, but does not meet the terms of this question.

sold Ṁ3 YES5mo

Doesn't really work without the word "red" though

5mo

@AndrewMcKnight So if you include other colors, it fails?

5mo

@IsaacKing I only tried with and without the word "red". Not other colors

6mo

Haven't tested this thoroughly, but I think new Aurora model in Grok knows the 2d shape.

6mo

5mo

@TimofeyValov Promising! Assuming these are representative, it looks like it knows the word "pentagon", but can't handle a description of a 5-sided shape, which is not sufficient to resolve this to YES. Getting there though!

5mo

@IsaacKing seems like aurora does not understand the '5-gon' and really wants it to be 3d

edit: oops, lots of people already tried this below.

Did anyone try pentagonal cage fight? Very hard to get it to do anything but 8 sides, but sometimes I can get something slightly different

asking for a square cage gave me either 4 or 6 sides, it's hard to say:

And I'm not sure if this is a 2 on 1 fight or a free for all.

What is this?

What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Or create your own play-money betting market on any question you care about.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.
ṀWhy use play money?
Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.
Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.
© Manifold Markets, Inc.TermsPrivacy