Will any open-source Transformers LLM model that function as a dense mixture of experts be released by end of 2024?

1kṀ2268

resolved Jan 1

Resolved

ALL

Will any open source or weights Transformers LLM based model emerge that is functionally a dense version of mixture of experts where the empirical mathematical sparsity resembles dense models like Llama 3.1 405B or Mistral Large Enough. A tool that allows for the creation of this type of model even if no model is released along with it would resolve as yes as long as it is possible to create the model for example Mergekit for various ways of model manipulation. A paper would only resolve as yes if there was an accompanying model, functional code released, or implementation by a third party.

Technology

Technical AI Timelines

LLMs

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ607
2		Ṁ92
3		Ṁ4

People are also trading

When will a non-Transformer model become the top open source LLM?

Will Transformer based architectures still be SOTA for language modelling by 2026?

81% chance

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

45% chance

Will OpenAI release another open source LLM before end of 2026?

77% chance

When will OpenAI release their next open-weight LLM model?

8/14/27

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

63% chance

Will Transformer-Based LLMs Make Up ≥75% of Parameters in the Top General AI by 2030?

50% chance

Will superposition in transformers be mostly solved by 2026?

Sort by:

Is this even possible? I don't get it

bought Ṁ250 NO

@Kearm20 This hasn't happened so far right? Mixtral 8x7B and co don't count? they aren't "dense"?

@Bayesian It is technically "possible" but it looks like research has moved onto inference time compute and mixture of agents.

@Kearm20 So you're not aware of any reason this market would resolve YES

@Bayesian No. There was some promising early work that I followed but it seemingly has fallen out of favor for transformers LLMs as Qwen QwQ and other work has taken the spotlight. If a lab drops a dense MoE before then end of the year though it would resolve as a yes and we still have two weeks.