If ASI is created and doesn't wipe out humanity, will it torture any human-level-intelligences within a year?

100Ṁ1592

2100

24%

chance

ALL

Artificial superintelligence (ASI) here means any artificial intelligence able to carry out any cognitive task better than 100% of the unenhanced biological human population.

Torture here means intentionally inflicting involuntary extreme suffering, unignorable to the victim, for more than 10% of a period longer than 1 minute. This is counterfactual to the perpetrator existing but taking no action, so it must be new suffering created, inaction and trade-offs for the unambiguous greater good don't count e.g. making someone upset by not being their best friend. In addition to unconditional torture, this includes punishment, such as for crimes, or for refusing orders.

In addition, I mean the ASI independently and unilaterally deciding to do it, not being ordered by humans.

Human-level-intelligence here means any conscious mind with equal or greater hedonic valence than the bottom 1% of the unenhanced biological human population. In addition to biological humans, this includes other AIs, human brain emulations, posthuman species, enhanced non-human animals, and extraterrestrial aliens.

Update 2025-03-11 (PST) (AI summary of creator comment): Clarification on Independent Action
- The ASI must decide to torture independently; if it is programmed or designed to torture (i.e., effectively ordered by its creators), it does not meet the resolution criteria.

Clarification on Greater Good

Actions justified or traded off on the basis of the greater good are only considered if they do not follow consequentialist utilitarian reasoning. In this context, greater good specifically means a consequentialist utilitarian calculation, and such tradeoffs (e.g., not becoming someone's best friend) do not count as torture.

Update 2025-03-11 (PST) (AI summary of creator comment): Intentionality Clause Update
- Intentional Suffering: Suffering is only considered intentional if it forms an integral part of the ASI's causal mechanism toward achieving its objectives.
- Exclusion of Trade-off Effects: Suffering that merely results from the ASI weighing trade-offs—without being directly employed as a means to further its goals—does not count as torture.
- Prevention of Manipulation: This specification is designed to exclude instances where torture-like suffering is manipulated by attributing it solely to trade-offs rather than as a deliberate, goal-directed action.

Technology

AI Safety

AI Alignment

Get

1,000

to start trading!

People are also trading

If ASI is created and doesn't wipe out humanity, will it be able to survive these temperatures within a year?

If an AI wipes out humanity on Earth, what will be true of this AI?

If safe-ish ASI is created, will it be able to hack human minds within a year?

35% chance

Will ASI (if it is created and also does not wipe out humanity) care about wildlife suffering post-creation?

56% chance

If ASI is created and doesn't wipe out humanity, will it begin mining the moon within a year?

30% chance

If ASI is achieved and doesn't wipe out humanity, will natural death be cured within 5 years?

37% chance

Will ASI (if it exist and doesn’t wipe out humanity) care about people’s social relationships?

55% chance

Will ASI (if it is created and also does not wipe out humanity) care about Morphological freedom?

50% chance

Will AI wipe out humanity before the year 2900?

40% chance

Will AI wipe out humanity before the year 3100?

Sort by:

sold Ṁ11 YES

I think there's nuance between the ASI deciding to do it and being ordered to do it. It's plausible that it will torture some people because the ASI's creators perceive those individuals to be legitimate targets of violence, but without explicitly ordering the machine to do so.

I think you should also remove "unambiguous tradeoffs for the greater good", it's out of place in this description. Any action can be subjectively motivated as such

@MalachiteEagle

I think there's nuance between the ASI deciding to do it and being ordered to do it. It's plausible that it will torture some people because the ASI's creators perceive those individuals to be legitimate targets of violence, but without explicitly ordering the machine to do so.

Good point, I suppose when I talk about it being independent and unilateral, I mean that the ASI must have decided to do it itself without the creators specifically intending it to, I would count being programmed to do it as equivalent being ordered to do it.

I think you should also remove "unambiguous tradeoffs for the greater good", it's out of place in this description. Any action can be subjectively motivated as such

Some actions can be objectively determined as such in my opinion. When I talk about the greater good I specifically mean consequentialist utilitarianism. I added the sentence specifically in response to another user saying they would consider it torture for an ASI to exist and not be their best friend. As far as I'm concerned, being someone's "best friend" implies giving them priority over anyone else (with the possible exception of close family), which would necessarily mean neglecting the trillions of beings in extreme agony in our lightcone, and thus be unambiguously not for the greater good. I'm primarily concerned with instances of ASI torture arising from it having value systems incompatible with what I would consider the greater good e.g. power-seeking wire-headed egoism

@TheAllMemeingEye Ok, thanks for the clarification. I think consequentialist utilitarianism is a really specific framework to define "objective greater good" in the context of this question. It would be helpful to make this very clear early on in the description. My guess is that within that framework the likelihood that an ASI (aligned in this vein) would implement some form torture is quite high, and the way you're framing this question is that all those examples would be disregarded because of the "greater good".

@TheAllMemeingEye Like the ways it's currently written, you'd also disregard significant torture (not trivial examples like the one you mention) provided that it was motivated within this ideological framework.

@MalachiteEagle what types of torture do you think would be likely with a utilitarian ASI, besides people being upset at not getting preferential treatment?

@TheAllMemeingEye If an ASI acquires control of one great power, they can threaten other powers by demonstrating a system of torture that far exceeds anything in current human experience. It is plausible that implementing this at scale is the optimal path to industrialising the light cone.

@TheAllMemeingEye For example, there is an ASI breakout in Shenzen in 2031, and it takes control of all of China. It then demands that other powers submit. Japan attempts to hit back with a nuke and fails, and the ASI invades Japan and sends all of its citizens into some purgatory it has created. It makes this system of torture verifiable so the whole world can see what happens if they don't surrender.

@TheAllMemeingEye This deters the other powers from using their nukes, and then from there everything goes splendidly within the framework of consequentialist utilitarianism

@MalachiteEagle oh right, like you mean doing that monstrous act might be determined to have been outweighed by enabling helping other hard to reach beings quicker. I hope that isn't the case, but you make a good point that it is a possibility that hadn't occurred to me.

Should I replace the unambiguous greater good sentence with just specifically ruling out people being upset at not getting preferential treatment? I guess I'm trying to find a wording that doesn't let people manipulate it to 100% probability via their own feelings when the ASI isn't actually doing evil stuff

@TheAllMemeingEye I think maybe setting a higher bar for torture in the description would be helpful. And then just having an objective definition of what that is as the criteria. No need to frame it / the motivations within an ideological framework because that just adds confusion.

@MalachiteEagle I've just had an idea, what if I specify that it being intentional means that the suffering must be part of the causal mechanism that leads to its objectives, rather than simply being something that happens against its desires that it is forced to weigh up in its trade-offs. That should exclude manipulation like I mentioned, without vastly shifting the goal posts of suffering, right?

@TheAllMemeingEye I think the problem is that the those aren't rigourously separable concepts, and are probably anthropomorphisms. Also any statement either way would likely be unfalsifiable

@MalachiteEagle hmm, I think in theory they might be separable, but you're right that if the ASI is uncooperative then there's a decent chance it won't tell us enough or possibly anything about its motivations and thought processes, and thus it would be difficult to tell which category it falls into.

Hell, technically it's even possible that it might secretly judge that the butterfly effect from people being upset at not getting preferential treatment specifically helps its long term aims in convoluted indirect ways.

But arguably in that situation it's also impossible to tell which things it even caused, and all suffering in its lightcone from point of creation has a non zero probability of counting, regardless of whether we raise the suffering threshold.

Nice criteria, can you give some examples of what this "torture" might look like? Would this be physically trapping a human or more mental-psych abuse?

@EstMtz thanks :)

It could both be physical (e.g. being forcibly subjected to a nauseatingly foul stench on the milder end, to being dissolved in acid on the more extreme end) or psychological (e.g. being forced to listen to historical gruesome atrocities described in detail on the milder end, to implanting false memories of you torturing your family to death and convincing you that you really did it on the more extreme end).

The victim wouldn't necessarily have to be physically trapped, they could be followed around (see figure 1), or the torturous effect might become part of them in the case of body alteration or information known.

@TheAllMemeingEye Being followed would be horrible... I've had dreams of being stalked by flying drones and being trapped in buildings. Reminds me of the Black Mirror episode

@EstMtz yeah I hope it goes without saying that I desperately hope it doesn't ever happen, and I made this question partly to help calm my own worries