If Artificial General Intelligence has an okay outcome, what will be the reason?

MANIFOLD

554

Ṁ18kṀ510k

2200

18%

Other

18%

AIs will not have utility functions (in the same sense that humans do not), their goals such as they are will be relatively humanlike, and they will be "computerish" and generally weakly motivated compared to humans.

12%

Humanity coordinates to prevent the creation of potentially-unsafe AIs.

Alignment is not properly solved, but core human values are simple enough that partial alignment techniques can impart these robustly. Despite caring about other things, it is relatively cheap for AGI to satisfy human values.

Eliezer finally listens to Krantz [resolves NO]

Yudkowsky is trying to solve the wrong problem using the wrong methods based on a wrong model of the world derived from poor thinking and fortunately all of his mistakes have failed to cancel out

Multipolar AGI Agents run wild on the internet, hacking/breaking everything, causing untold economic damage but aren't focused enough to manipulate humans to achieve embodiment. In the aftermath, humanity becomes way saner about alignment.

1.5%

Ethics turns out to be a precondition of superintelligence

1.5%

Hacks like RLHF-ing self-disempowerment into frontier models work long enough to develop better alignment methods, which in turn work long enough to ... etc; we keep ahead of 'alignment escape velocity'

1.5%

We create a truth economy. https://manifold.markets/Krantz/is-establishing-a-truth-economy-tha?r=S3JhbnR6

1.2%

Aligned AI is more economically valuable than unaligned AI. The size of this gap and the robustness of alignment techniques required to achieve it scale up with intelligence, so economics naturally encourages solving alignment.

High-level self-improvement (rewriting code) is intrinsically risky process, so AIs will prefer low level and slow self-improvement (learning), thus AIs collaborating with humans will have advantage. Ends with posthumans ecosystem.

A concerted effort targets an agent at a capability plateau which is adequate to defer the hard parts of the problem until later. The necessary near-term problems to solve didn't depend on deeply modeling human values.

AI control gets us helpful enough systems without being deadly

AGI is never built (indefinite global moratorium)

Duplicate of https://manifold.markets/EliezerYudkowsky/if-artificial-general-intelligence with user-submitted answers. An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

Market context

Fun

Get

1,000

to start trading!

People are also trading

If Artificial General Intelligence has a poor outcome, what will be the reason?

If we survive general artificial intelligence, what will be the reason?

When artificial general intelligence (AGI) exists, what will be true?

If we survive general artificial intelligence before 2100, what will be the reason?

If AI has an okay outcome, was it because of humanity doing something beyond business-as-usual?

63% chance

Will Artificial General Intelligence (AGI) lead directly to the development of Artificial Superintelligence (ASI)?

68% chance

If AI has an okay outcome because of a huge alignment effort, where did AI progress stall out?

The probability of extremely good AGI outcomes eg. rapid human flourishing will be >24% in next AI experts survey

63% chance

Will General Artificial Intelligence happen before 2035?

68% chance

Will the control problem be solved before the creation of "weak" Artificial General Intelligence?

5% chance

Sort by:

I don't understand why people keep betting this up. We don't see ais acting "computerish" and I dont see why they would begin to do so in the future

@MaxE Agreed! And at the same time the answer wants them to be relatively human-like, and I can't see both being true…

@4fa They will both be true.

Their goals will be humanlike.

Their attitudes will be computerish (and they are now.)

@MaxE
>We don't see ais acting "computerish"

I think by 'computerish' they mean something like 'extremely malleable to instruction' which we do see with LLMs. If you change the system prompt of an LLM (or break out the big guns and fine-tune it) it is extremely easy to change its behavior.

The independent version is live, starting with 10 options: https://manifold.markets/4fa/independent-mc-version-if-artificia

[Independent MC Version] If Artificial General Intelligence has an okay outcome, what will be the reasons?

This is an independent (aka unlinked) version of: https://manifold.markets/EliezerYudkowsky/if-artificial-general-intelligence-539844cd3ba1 I will add new options based on the following polls: (https://manifold.markets/embed/4fa/multiselect-version-which-answers-t) (https://manifold.markets/embed/4fa/multiselect-version-which-answers-t-9O98NdAyP6) https://manifold.markets/4fa/which-answers-should-be-kept-when-m https://manifold.markets/4fa/google-form-in-description-which-an Original market's description: An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

I will add additional options based on the following polls:

https://manifold.markets/4fa/which-answers-should-be-kept-when-m

https://manifold.markets/4fa/google-form-in-description-which-an

What is AGI will just replace the existing world powers , just as is. And we will not feel any difference at all?

@1bets I mean, that counts as an okay outcome. The reason it just replaces existing powers will be what the market resolves to

bought Ṁ150 YES

Voted based on my research, summarized here: https://blog.ideanexusventures.com/the-conscious-economy/

bought Ṁ50 NO

That seems like a lot of fancy neologisms to say “we should use AI to automate tedious things like paperwork”, and I don’t see what it has to do with the quantum mechanics aspect of the market you bet on.

@Kronopath how did I get in on this at 1.8% to 9% and now the orderbook looks like this? lol insane this was ever so low.

sold Ṁ407 NO

@EliezerYudkowsky I really think it should be more like 0.001% (10^-24%?) of the "maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly)".

bought Ṁ10 YES

An outcome is "okay" if it gets at least 20% of the maximum attainable cosmopolitan value that could've been attained by a positive Singularity (a la full Coherent Extrapolated Volition done correctly), and existing humans don't suffer death or any other awful fates.

Tons of unimaginably amazing, extremely good futures don't qualify as "okay" by this definition, hmm.

What exactly is the plan to resolve the multiple non-contradictory resolution criteria? Will there be some kind of "weighted according to my gut feeling of how important they are"? Will they all resolve "yes"? Or is it "I will pick the one that was most centrally true"?

It would be nice if there was some kind of flow-chart for resolution like in my "if AI causes human extinction" market.

I've blocked Krantz, which I don't know whether it prevents him from creating new answers. I don't seem to have the ability to resolve the current answers N/A, and would hesitate to resolve "No" under the circumstances unless a mod okays that.

@EliezerYudkowsky

I don't seem to have the ability to resolve the current answers N/A, and would hesitate to resolve "No" under the circumstances unless a mod okays that.

Unfortunately this is a dependent multiple choice market, so all options have to resolve (summing to 100% or N/A) at the same time. So it's not a question of whether that's ok with mods, it simply isn't possible given the market structure.

It's a not uncommon issue that popular dependent MC markets get many unwanted answers added. It would be great if there were better tools to control this, but unfortunately the options are pretty blunt. My personal recommendation (but totally up to you) would be to change the market settings so that only the creator can add answers---then, people can make suggestions in the comments, and you can choose whether to include them or not. (I can make that change to the settings if you'd prefer, but it's under the 3 dots for more market options).

You can also feel free to edit any unwanted answers to just say "N/A" or "Ignore" or etc, to partially clean up the market (& clarify where attention should go). That's very much within your right as creator. But there's no way to actually remove the options (or resolve them early, although they will quickly go to ~0% with natural betting).

@EliezerYudkowsky If it's not too much of a hassle, would you also consider making an unlinked version of this market with the most promising options copied over, so that the non mutually exclusive options don't distort each others' probabilities? I know I could do this myself if necessary but your influence brings vastly more attention to the market and this seems like a fairly important market question. Maybe the wording would need to be very slightly altered to "...what will be true of the reason?"

@EliezerYudkowsky Least hassle approach: Start with "Duplicate" in the menu…

…then "Choose question type"…

…choose "Set" instead…

…delete the answers you don't want to keep. (When I tested, the answers carried over.)

@EliezerYudkowsky An alternative to N/A-ing this entire market would be to unlist it:

…in response to @TheAllMemeingEye's concern that "[this market] makes the site look bad being promoted so high on the home page".

bought Ṁ10 NO

@4fa superb advice :) I didn't realise it was that easy lol

@EliezerYudkowsky I would recommend to just edit all of Krantz’s options to [Resolves No]

Bafflingly, @EliezerYudkowsky appears to be the (distant) second-biggest Yes holder on Krantz’s options. I’m not sure how that happened. (Some kind of auto-betting from betting on “Other” or something?)

@Kronopath When one holds YES shares in 'Other', one is awarded that number of YES shares in any subsequently added options.

@Kronopath In addition to what jim explained, you can also see that it says "Spent Ṁ0".

Despite being blocked, he's back again throwing mana at his own options, ffs. I am in favor of editing all of Krantz’s options to [Resolves No].

@Krantz This was too long to fit.

Enough people understand that we can control a decentralize GOFAI by using a decentralized constitution that is embedded into a free and open market that sovereign individuals can earn a living by aligning. Peace and sanity is achieved game theoretically by making the decentralized process that interpretably advances alignment the same process we use to create new decentralized money. We create an economy that properly rewards the production of valuable alignment data and it feels a lot like a school that pays people to check each other's homework. It is a mechanism that empowers people to earn a living by doing alignment work decentrally in the public domain. This enables us to learn the second bitter lesson: "We needed to be collecting a particular class of data, specifically confidence and attention intervals for propositions (and logical connections of propositions) within a constitution.".

If we radically accelerated the collection of this data by incentivizing it's growth monetarily in a way that empowers poor people to become deeply educated, we might just survive this.