Will data poisoning cause problems for AI image generators in 2024?

1kṀ5195

resolved Jan 1

Resolved

ALL

Nightshade, a tool for "poisoning" images and making them unusable for training AI models, has recently been released. Many artists have expressed interest in using tools like Nightshade to prevent their art from being used to train image generators.

This market will resolve to YES if any of the following happens by the end of 2024:

Widespread poisoned data causes noticeable degradation in the outputs of a major AI image generator.
Notable amounts of effort are devoted to filtering out or altering poisoned images in datasets. For example, regularly being forced to do extra preprocessing to avoid data poisoning for a large portion of images in a dataset would count.
AI companies make some form of concessions to artists who poison their images such as no longer training on their work or paying royalties to the artists.

I won't bet on this market.

AI Impacts

AI art

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ281
2		Ṁ53
3		Ṁ20
4		Ṁ15
5		Ṁ11

People are also trading

Will there be an open source, uncensored AI image generator with the same or greater quality as DALLE-3 by end of 2025?

88% chance

What AI safety incidents will occur in 2025?

Will anybody be sentenced to prison as a result of publishing unintended AI-generated content before 2026?

9% chance

Will there be a disaster caused by open source developers doing unsafe things with AI by 2028?

62% chance

Will any computer virus powered by AI cause large damages to digital infrastructure by 2027?

27% chance

Will something AI-related be an actual infohazard?

69% chance

Will a major AI lab announce that the weights of one of its models were compromised in a cyberattack before 2027?

60% chance

Will an AI datacenter in the US be sabotaged before 2029?

54% chance

Will Generative AI trained on crawled art be illegal in 2027 because of copyright?

18% chance

Will AI interpretability techniques reveal an AI to have been plotting to take over the world before 2028?

Sort by:

opened a Ṁ3,000 NO at 25% order

Take my limit order?

I see certain platforms like pixiv of Pinterest provide this service. I don't think that will silently seep into image models though, people will say "don't use my work" and those works will not be used.

Non-trivial amounts of effort must be devoted to filtering out or altering poisoned images in datasets

I think this should be edited a bit, 'non-trivial' could still refer to a very small amount of effort.

Poisoning data works to prevent the tracking of like one individual (e.g. poisoning their social media timeline) but unless a large number of artists start doing this (which won’t happen) then it’s not going to affect AIs even with a naive training process that just scrapes everything off the internet.

On the off chance enough people started doing this that it started to become a problem, computer scientists can fall back on one of several strategies:-

1: use a simple heuristic to sort poisoned images from non-poisoned ones (e.g. must have got at least 5 upvotes on Reddit)

2: train a classifier to identify legitimate images and block poisoned ones

3: literally just amplify the existing models

4: outsource the identification and removal of poisoned images to data farms

5: use human-in-the-loop learning to train a reward model which can be used as part of an adversarial training process

@DaisyWelham I wouldn’t be surprised to find a classifier on huggingface already or in a week or so tops

predictedNO

@mariopasquato Yeah, I mean the code is identical to the discriminator part of a GAN so they literally just need to train the discriminator on the data set with poisoned images and then use the discriminator as the filter for a new data set. At most this is a mild inconvenience.