Will Anthropic open-source the training code of their SAE interpretability effort?
5
1kṀ485
2028
14%
this year, fully
29%
this year, significantly incomplete
19%
next year
23%
not before 2028
14%
Other

We mean the code used for producing Scaling Interpretability blog post.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules