If Artificial General Intelligence has an okay outcome, what will be the reason?
345
17kṀ160k
2200
13%
Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out
11%
AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.
8%
Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.
7%
Other
7%
Eliezer finally listens to Krantz.
6%
Humans become transhuman through other means before AGI happens
4%
Humanity coordinates to prevent the creation of potentially-unsafe AIs.
4%
Power dynamics stay multi-polar. Partly easy copying of SotA performance, bigger projects need high coordination, and moderate takeoff speed. And "military strike on all society" remains an abysmal strategy for practically all entities.
3%
ASI needs not your atoms but information. Humans will live very interesting lives.
3%
Some form of objective morality is true, and any sufficiently intelligent agent automatically becomes benevolent.
3%
AIs never develop coherent goals
2%
Almost all human values are ex post facto rationalizations and enough humans survive to do what they always do
1.8%
Someone creates AGI(s) in a box, and offers to split the universe. They somehow find a way to arrange this so that the AGI(s) cannot manipulate them or pull any tricks, and the AGI(s) give them instructions for safe pivotal acts.
1.7%
Nick Bostrom's idea (Hail Mary) that AI will preserve humans to trade with possible aliens works
1.3%
"Corrigibility" is a bit more mathematically straightforward than was initially presumed, in the sense that we can expect it to occur, and is relatively easy to predict, even under less-than-ideal conditions.
1.2%
AGI is never built (indefinite global moratorium)
1.2%
An AI that is not fully superior to humans launches a failed takeover, and the resulting panic convinces the people of the world to unite to stop any future AI development.
1.2%
A lot of humans participate in a slow scalable oversight-style system, which is pivotally used/solves alignment enough
1.2%
Alignment is unsolvable. AI that cares enough about its goal to destroy humanity is also forced to take it slow trying to align its future self, preventing run-away.

Duplicate of https://manifold.markets/EliezerYudkowsky/if-artificial-general-intelligence with user-submitted answers. An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy