For typical SOTA AI systems in 2028, will it be possible for users to know the true reasons for systems making a choice? | Manifold

For typical SOTA AI systems in 2028, will it be possible for users to know the true reasons for systems making a choice?

29

Never closes

Very Unlikely (<10%)

Unlikely (10-40%)

Even odds (40-60%)

Likely (60-90%)

Very likely (>90%)

For typical state-of-the-art AI systems in 2028, do you think it will be possible for users to know the true reasons for systems making a particular choice?

By “true reasons” we mean the AI correctly explains its internal decision-making process in a way humans can understand. By “true reasons” we do not mean the decision itself is correct.

Market context

Expert survey on AI progress, 2023

Get

1,000

to start trading!

Sort by:

I would be concerned if we were relying on the AI's explanation of its reasons without being able to verify that explanation. To me, knowing the true reasons implies having interpretability techniques that can reliably decode the meaning of a change in neural net weights. If we are relying on an AI to tell us the reasons for a particular output, then we have built an AI with good self-understanding, but we do not understand that AI very well. Which seems hard to do, and also bad if we manage to do it.

People are also trading

SOTA AI at EOY 2026 a reasoning model?

[Carlini questions] SOTA AI systems still regularly "hallucinate" incorrect solutions to problems

Any SOTA AI model uses human-understandable thinking medium at the end of 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

[Carlini questions] A highly-popular SOTA AI system is supported by advertising

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

Will a SOTA open-sourced LLM forecasting system make major use of quasilinguistic neural reps (QNRs) before 2027?

[Carlini questions] Cost of single SOTA AI system training run by 2030

By what date will encoded reasoning via alien reasoning be demonstrated in a SOTA reasoning model?

Will a SOTA model be trained with Kolmogorov-Arnold Networks by 2029?

Related questions

SOTA AI at EOY 2026 a reasoning model?

[Carlini questions] SOTA AI systems still regularly "hallucinate" incorrect solutions to problems

Any SOTA AI model uses human-understandable thinking medium at the end of 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

[Carlini questions] A highly-popular SOTA AI system is supported by advertising

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

Will a SOTA open-sourced LLM forecasting system make major use of quasilinguistic neural reps (QNRs) before 2027?

[Carlini questions] Cost of single SOTA AI system training run by 2030

By what date will encoded reasoning via alien reasoning be demonstrated in a SOTA reasoning model?

Will a SOTA model be trained with Kolmogorov-Arnold Networks by 2029?

© Manifold Markets, Inc.•Terms•Privacy