
Considering that:
a) data pollution (large scale injection of AI generated data into the information space) and subsequent model collapse have been identified [1] as potential threats [2] for future LLM's and
b) advanced AI models will impact the geopolitical power distribution [3] and therefore be increasingly subject to geostrategic contention [4],
Do you believe that by the end of 2025, there will be evidence of large scale organized data pollution operations by state or non-state actors with the implicit or explicit goal of denigrating the performance of future LLM's taking or having taken place?
Resolution:
This market will resolve as YES if at any point before 01/01/2026 credible information will emerge that a deliberate data pollution operation by any actor (state/non-state) for any reason (geopolitical contestation, ideology, terrorism, lulz) has taken place.
Caveat: the operation must be/have been significant enough to warrant mention by a reputable news source (e.g. the NYT, WSJ, WP, BBC etc.), a government communication, a peer-reviewed scientific publication, a reputable threat intelligence service provider and/or other reputable sources not covered in this list.
I reserve the right to final judgement.
🏅 Top traders
| # | Name | Total profit |
|---|---|---|
| 1 | Ṁ573 | |
| 2 | Ṁ152 | |
| 3 | Ṁ133 | |
| 4 | Ṁ79 | |
| 5 | Ṁ61 |
People are also trading
While inclining to resolve YES, I also understand the case to resolve NO. I found it hard to come to a decision I could stand by so I asked a bunch of AI models by simply pasting the market description. They would resolve the market as follows:
Free ChatGPT model: NO
Grok Expert: YES
Gemini 3 Pro: YES
Claude Sonnet 3.5: YES
I therefore resolve it as YES and take home as a learning that I won't create any further markets which can not be resolved through clear binary outcomes determined by someone else.
@Symmetry What's the credible information that a deliberate data pollution happened, upon which the market resolved to as YES?
Inclined to resolve YES because of the following article. Any disagreement? @traders
https://manifold.markets/post/will-there-be-evidence-of-large-sca?r=U3ltbWV0cnk
@Symmetry I do. Kindly read the Washington post article. There is no credible information of intentional data pollution mentioned in it.
The talk seems mostly speculative imo. Now im not disagreeing that data pollution is real but, the article itself is basically saying "propaganda exists, we feed everything to AI, AI cant tell the difference, therefore this is data pollution."
There is no... proof?
The post does not prove Russia has intentionally “poisoned” LLMs at scale.
What it shows is:
Russian/pro-Kremlin networks flood the web with low-quality content (documented).
LLMs can echo dominant web content in data-poor areas.
But several researchers (e.g. HKS Misinformation Review) argue this is better explained by data voids and weak source curation, not deliberate, reliable AI grooming.
So it could be intentional data pollution, but claims of demonstrated, large-scale AI manipulation are currently assumptions.
In other words
This article doesn't provide credible information that a deliberate data pollution did indeed happen.
Hello! Sorry for the inconvenience.
I just noticed that the market creator might have missed my comment. Furthermore, they resolved the market based on asking 4 different AI models by only pasting the market criteria without providing an article that provides credible information of deliberate data pollution.
I understand that the market creator reserves the right to final judgement. But not providing an article that at least, shows SOMEWHAT credible information and resolving ONLY using 4 AI models goes against the market's criteria for resolving yes.
This market will resolve as YES if at any point before 01/01/2026 _credible_ information will emerge that a _deliberate_ data pollution operation by any actor (state/non-state) for any reason (geopolitical contestation, ideology, terrorism, lulz) has taken place.
@ItsMe you can search for Nightshade which is emphasized in the paper as requiring very little sample share to have its intended negative effect.
@JussiVilleHeiskanen No, it's effective on a large scale with a small "dose." I don't think this market is about how efficient the pollution is, and I certainly don't think it's about only inefficient pollution, but I could be wrong!
@Panfilo "large scale organized data pollution operations" -- from description. Single site isn't that organized an operation.
@JussiVilleHeiskanen Then was there a reason you don't think the WaPo Russia story in the further back comments counts?
I bought because of this: https://manifold.markets/post/will-there-be-evidence-of-large-sca?
