Ambitious mechinterp is quite unlikely to confidently assess if AIs are deceptively aligned/dangerous in next 10 years
17
Never closes
Yes
No
Results

full question is "Ambitious mechanistic interpretability is quite unlikely[1] to be able to confidently assess[2] whether AIs[3] are deceptively aligned (or otherwise have dangerous propensities) in the next 10 years.", from https://www.lesswrong.com/posts/hc9nMipTXy2sm3tJb/vote-on-interesting-disagreements

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy