Will we be able to read AIs minds in 2030?

Ṁ1kṀ852

2030

25%

chance

ALL

Will Digital Neuroscience succeed in this decade? Resolves yes if there is at least one AI model at the end of 2030 for which humans can read what it's thinking and understand how the model works, and is close to the performance of the state of the art AGI.

Market context

Mechanistic interpretability

Get

1,000

to start trading!

People are also trading

Will AI be able to read minds by 2030?

41% chance

Will AI surpass human intellect by 2030?

86% chance

Will anyone have their mind read without their prior consent before 2030?

22% chance

Will superintelligent AI take over humanity by 2030?

7% chance

Will AI wipe out AI before the year 2030?

4% chance

Will we have a reliable way of uploading people's minds onto computers by 2030?

4% chance

Will major tech company announce functional brain-computer interface able to read complex human thoughts by end of 2030?

26% chance

Will AI allow us to talk to animals by 2034?

65% chance

Will mind-reading be used in a US court of law before 2030?

5% chance

Will an AI simulate human consciousness by 2100?

Sort by:

How much do we need to understand? Would or current levels of understanding of human minds work, or does it need to be more?

@MartinRandall no that would not be sufficient, we cannot currently "read people's minds"

@ahalekelly So more than this? How much more?

https://www.technologyreview.com/2023/05/01/1072471/brain-scans-can-translate-a-persons-thoughts-into-words/

How well do we need to understand how the model works?

@NoaNabeshima that's hard to define, but the current understanding of models being "this giant inscrutable matrix was trained over this data set with this loss function" isn't very useful to us. The primary reason I'm interested in seeing what a model is thinking is to determine whether it is deceiving us.

I need to read up on interpretability research, but if there were a breakthrough in the field such that we could identify the purposes of different sections of a LLM, and be able to get a human interpretable output from most sections of the model at any point, that would be sufficient to resolve yes.

If model paradigms that provide better interpretability become the new state of the art in AI capabilities, I'm open to arguments about their capability for deception. For example, AutoGPT came out after I made this market and seems like it could lead to a somewhat more promising future for interpretability. For AutoGPT-style models, which are still inscrutable LLM at their core but using a human-interpretable scratchpad for their short term memory, I suppose it would depend on how reliant they are on that scratchpad, and whether they are capable of keeping illicit thoughts out of their scratchpad in order to deceive us.