If LMs store info as features in superposition, does # features scale superlinearly with number of model parameters?
© Manifold Markets, Inc.TermsPrivacy