Will Anthropic's April SAE Training Updates stack with Gated SAEs?
2
17
Ṁ63Ṁ120
Dec 31
64%
chance
1D
1W
1M
ALL
If training Gated SAEs [1] with Anthropic's new training techniques [2] gives equal loss recovered with <90% of the L0 value of either of the techniques individually applied, at the majority of sites in Pythia-2.8B / Gemma-7B / Mistral-7B (whichever actually gets benchmarked), this resolves yes.
Resolves when information is publicly available about the outcome.
I haven't implemented [2] yet so have no insider information, and also I will not trade in this market besides an initial bet.
[1]: https://arxiv.org/abs/2404.16014
[2]: https://transformer-circuits.pub/2024/april-update/index.html#training-saes
Get Ṁ600 play money
More related questions
Related questions
Are Gated SAEs better than Anthropic's training updates?
63% chance
Do Anthropic's training updates make SAE features as interpretable?
50% chance
What will the Anthropic SAE paper contain?
Will Anthropic, before 2035, pause development for at least a year as a result of safety evaluations?
29% chance
Will Anthropic's RSP security commitments (as of Oct. 28 2023) cause them to pause scaling for at least one month?
27% chance
Will Anthropic, before 2035, pause development for at least six months as a result of safety evaluations?
44% chance
SoAI 23 3/10: Will Self-improving Al agents crush SOTA in a complex environment (e.g. AAA game, tool use, science)?
29% chance
Will Anthropic release an image generation system by mid 2024?
32% chance
Will Anthropic release an image generator in 2024?
52% chance
Will Anthropic raise an up/flat/down/no round Series C by the end of 2026?