Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%? | Manifold

Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?

4

90Ṁ190

2223

10%

chance

1H

6H

1D

1W

1M

ALL

Get

1,000

to start trading!

People are also trading

What will Manifold's P(doom) be at the end of 2025?

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

By 2035, will mechanistic interpretability enable Nobel Prize-winning work?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

Will MIRI meaningfully affect p(doom) by more than 5%?

Will mechanistic interpretability be essentially solved for the human brain before 2040?

Related questions

What will Manifold's P(doom) be at the end of 2025?

Will mechanistic interpretability have more academic impact than representation engineering by the end of 2025?

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

By 2035, will mechanistic interpretability enable Nobel Prize-winning work?

Will manifold markets meaningfully affect p(doom) by more than 3%?

Will agent foundations [eg Scott Garrabrant] end up affecting p(doom) more than 5%?

Will davidad meaningfully affect p(doom) by more than 3%?

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

Will MIRI meaningfully affect p(doom) by more than 5%?

Will mechanistic interpretability be essentially solved for the human brain before 2040?

© Manifold Markets, Inc.•Terms•Privacy