For typical SOTA AI systems in 2028, will it be possible for users to know the true reasons for systems making a choice? | Manifold

For typical SOTA AI systems in 2028, will it be possible for users to know the true reasons for systems making a choice?

29

Never closes

Very Unlikely (<10%)

Unlikely (10-40%)

Even odds (40-60%)

Likely (60-90%)

Very likely (>90%)

For typical state-of-the-art AI systems in 2028, do you think it will be possible for users to know the true reasons for systems making a particular choice?

By “true reasons” we mean the AI correctly explains its internal decision-making process in a way humans can understand. By “true reasons” we do not mean the decision itself is correct.

Expert survey on AI progress, 2023

Get

1,000

to start trading!

Sort by:

I would be concerned if we were relying on the AI's explanation of its reasons without being able to verify that explanation. To me, knowing the true reasons implies having interpretability techniques that can reliably decode the meaning of a change in neural net weights. If we are relying on an AI to tell us the reasons for a particular output, then we have built an AI with good self-understanding, but we do not understand that AI very well. Which seems hard to do, and also bad if we manage to do it.

People are also trading

What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?

By 2026, the SOTA in image generation will be using a voice chat to control the generation.

By 2026, the SOTA in image generation will be using mind reading to control the generation.

SOTA AI at EOY 2026 a reasoning model?

[Carlini questions] SOTA AI systems still regularly "hallucinate" incorrect solutions to problems

Any SOTA AI model uses human-understandable thinking medium at the end of 2028?

[Carlini questions] A highly-popular SOTA AI system is supported by advertising

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

Will a SOTA open-sourced LLM forecasting system make major use of quasilinguistic neural reps (QNRs) before 2027?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

Related questions

What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?

By 2026, the SOTA in image generation will be using a voice chat to control the generation.

By 2026, the SOTA in image generation will be using mind reading to control the generation.

SOTA AI at EOY 2026 a reasoning model?

[Carlini questions] SOTA AI systems still regularly "hallucinate" incorrect solutions to problems

Any SOTA AI model uses human-understandable thinking medium at the end of 2028?

[Carlini questions] A highly-popular SOTA AI system is supported by advertising

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

Will a SOTA open-sourced LLM forecasting system make major use of quasilinguistic neural reps (QNRs) before 2027?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

© Manifold Markets, Inc.•Terms•Privacy