Will we “muddle through” alignment? | Manifold

Will we “muddle through” alignment?

10

100Ṁ137

2027

66%

chance

1H

6H

1D

1W

1M

ALL

In his “Situational Awareness” essays, Leopold Aschenbrenner posits we will “muddle through” the problem of aligning superhuman AI. I take this to mean we will have techniques which we are reasonably sure work, but without formal guarantees. Further, I expect that muddling through will involve new insights into AI control, but I expect the techniques will be unsatisfying from an agent foundations perspective.

If we get super intelligent AI, we will muddle through?

I’ll consider an AI super intelligent if one or more frontier labs claims it’s it so, and this is corroborated by one or more safety/eval lab.

Resolves yes if the AI is reasobably aligned one year after announcement without a no trigger:

Resolves no if there is a critical alignment failure discovered within the year period, either real or potential.

Resolves no if we the alignment method involves satisfying guarantees or arguments on the safety of the AI.

I’m not sure how best to pose the question or when to set the resolution date. Feedback solicited.

Update 2025-07-19 (PST) (AI summary of creator comment): If there is no superintelligent AI by the resolution date, this market will resolve to N/A.

Get

1,000

to start trading!

People are also trading

Will we solve AI alignment by 2026?

Will I think that alignment is no longer "preparadigmatic" by the start of 2026?

Will there be more alignmentforum posts from 2025 than 2024?

Will xAI significantly rework their alignment plan by the start of 2026?

Will there exist a compelling demonstration of deceptive alignment by 2026?

Will >= 1 alignment researcher/paper cite "maximum diffusion reinforcement learning" as alignment-relevant in 2025?

This is a solution to alignment.

Will I focus on the AI alignment problem for the rest of my life?

Will QACI turn out to be a viable alignment plan?

In 5 years will I think the org Conjecture was net good for alignment?

Sort by:

You should specify how this will be resolved if there is no superintelligent AI by the resolution date.

@LavrovAndrey open to feedback, but I think it would resolve N/A.

It seems like https://ai-2027.com/ basically predicts we will muddle through. Anyone disagree?

A research-backed AI scenario forecast.

bought Ṁ20 NO

We'll have to stop building systems first and then aligning them later, or we will definitely die at some point.

People are also trading

Will we solve AI alignment by 2026?

Will I think that alignment is no longer "preparadigmatic" by the start of 2026?

Will there be more alignmentforum posts from 2025 than 2024?

Will xAI significantly rework their alignment plan by the start of 2026?

Will there exist a compelling demonstration of deceptive alignment by 2026?

Will >= 1 alignment researcher/paper cite "maximum diffusion reinforcement learning" as alignment-relevant in 2025?

This is a solution to alignment.

Will I focus on the AI alignment problem for the rest of my life?

Will QACI turn out to be a viable alignment plan?

In 5 years will I think the org Conjecture was net good for alignment?

Related questions

Will we solve AI alignment by 2026?

Will I think that alignment is no longer "preparadigmatic" by the start of 2026?

Will there be more alignmentforum posts from 2025 than 2024?

Will xAI significantly rework their alignment plan by the start of 2026?

Will there exist a compelling demonstration of deceptive alignment by 2026?

Will >= 1 alignment researcher/paper cite "maximum diffusion reinforcement learning" as alignment-relevant in 2025?

This is a solution to alignment.

Will I focus on the AI alignment problem for the rest of my life?

Will QACI turn out to be a viable alignment plan?

In 5 years will I think the org Conjecture was net good for alignment?

© Manifold Markets, Inc.•Terms•Privacy