Do you think Mixture of Expert (MoE) transformer models are generally more human interpretable than dense transformers? | Manifold

Do you think Mixture of Expert (MoE) transformer models are generally more human interpretable than dense transformers?

15

Never closes

Yes

No

Results

If you'd like to trade based on confidence instead of just YES/NO, here's a market:

Mechanistic interpretability

Technical AI Safety

Get

1,000

to start trading!

People are also trading

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

Is gpt-3.5-turbo a Mixture of Experts (MoE)?

Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Before Feb 2026, will a transformer based reasoning model >1800 elo be able to explain 3+ chess lines at any position?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Related questions

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

Is gpt-3.5-turbo a Mixture of Experts (MoE)?

Will mechanistic/transformer interpretability [eg Neel Nanda] end up affecting p(doom) more than 5%?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Before Feb 2026, will a transformer based reasoning model >1800 elo be able to explain 3+ chess lines at any position?

Will Transformer based architectures still be SOTA for language modelling by 2026?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules