If LMs store info as features in superposition, are there >300K features in GPT-2 small L7? (see desc)
13
1kṀ286
3000
59%
chance

If in 2040 I am convinced at >80% confidence that LMs mainly store info in their residual stream as something like sparse linear features, and I am >80% confident in a particular approximate number of features in the residual stream before layer 7, market resolves Yes if that number is greater than 300K. If that number is less than 300K resolves No. Otherwise resolves N/A.

I won't bet in this market.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy