Dalle-3 via openai's website can directly annotate images by mid 2024
5
66
130
Jul 1
4%
chance
  1. submit image, say of a cat

  2. say something like "can you annotate this image with your notes on what parts of the cat you see? put 5 of them in bold black text"

  3. dalle3 doesn't just "reproduce" an image using image => text => dalle3 generation again, (which looks very little like the original)

  4. instead, it draws on top of the original image

  5. so the original is basically present, but with modifications overlaid

  6. it has to be through the normal UI

Interestingly, you can get it to output json blocks explaining what it sees in each region and then use python to look at that. But it's kind of messy and what it sees doesn't seem to be captured by that type of output well. I wonder if you can get the full embedding?

Today: fail

Get Ṁ200 play money