What will the Anthropic SAE paper contain?

Ṁ250Ṁ106

resolved May 22

Resolved

YES

Eye-test experiments

Resolved

YES

Some cherry-picked proof of concept for a useful *type* of task

Resolved

YES

Streetlight edits

Resolved

Doing PEFT by training sparse weights and biases for SAE embeddings in a way that beats baselines like LORA

Resolved

Passive scoping

Resolved

Finding and manually fixing a harmful behavior that WAS represented in the SAE training data

Resolved

Using an SAE as a zero-shot anomaly detector

Resolved

Latent adversarial training under perturbations to an SAE's embeddings

Resolved

Experiments to do arbitrary manual model edits

Resolved

Finding and manually fixing a novel bug in the model that WASN'T represented in the SAE training data

This will resolve according to Stephen Casper's judgments.

Market context

Anthropic

Mechanistic interpretability

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ26
2		Ṁ12
3		Ṁ3

Sort by:

Casper's judgements are out:

Thus, my assessment is that Anthropic did 1-3 but not 4-10.

That is, YES eye-test experiments, YES streetlight edits, YES cherry-picked proof of concept, NO everything else.

People are also trading

When will Anthropic IPO?

12/31/26

What will happen between Anthropic and the Pentagon?

Will Anthropic open-source the training code of their SAE interpretability effort?

What will be the next major event for Anthropic?

Who will join Anthropic by end of 2026?

Will there be a publicly available AI with Anthropic Mythos-level capabilities before 2027?

96% chance

What will Anthropic's initial share price be? (split-anchored Feb 27 2026)

547

Will Anthropic, before 2035, completely halt development of AI and attempt to persuade other organizations to do so?

18% chance

What will be Anthropic first major acquisition?

Anthropic make bigger revisions to RSP by EOY 2028?

76% chance

🏅 Top traders

People are also trading

Related questions