Will a new deep learning paradigm replace the transformer by the end of 2024?

21

1kṀ1460

resolved Jan 1

Resolved

NO

1H

6H

1D

1W

1M

ALL

Will a new neural architecture, or entirely different machine learning method, replace the dominance of the query, key, and value attention-based neural architectures currently dominant in large language models? Or will large language models (or more generally foundational models with different modalities) continue to scale up transformers? Fundamentally this new method must not employ layers of self-attention or cross-attention, but must show scaling laws more promising than transformer-based LLMs. This method must be commonly recognized by practitioners to be superior to transformer methods, and multiple state-of-the-art open (and closed) source models must employ this new method. From the invention of transformer, it took a few years for them to be universally utilized. However, with modern attention on foundational models, adoption of a better approach should be significantly more swift.

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ87
2		Ṁ30
3		Ṁ22
4		Ṁ18
5		Ṁ18

People are also trading

Will there be any large-scale protests about content generated by Deep Learning by the end of 2025?

By the start of 2026, will I still think that transformers are the main architecture for tasks related to natural language processing?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will geometric deep learning turn out to be as influential an idea as transformers?

Will transformers still be the dominant DL architecture in 2026?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2026, will it seem as if deep learning hit a wall by EOY 2025?

[Will Brown] Will Transformative AI look like an abundance of specialized models, in August 2026?

Sort by:

Personally I have a bit of faith in this concept:

https://arxiv.org/abs/2312.00752

predictedNO

@Ebcc1 Definitely on my radar!

predictedYES

@Supermaxman What do you think about it and other possibilities?

predictedNO

@Ebcc1 Watching to see how peers receive it at ICLR: https://openreview.net/forum?id=AL1fq05o7H

People are also trading

Will there be any large-scale protests about content generated by Deep Learning by the end of 2025?

By the start of 2026, will I still think that transformers are the main architecture for tasks related to natural language processing?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will geometric deep learning turn out to be as influential an idea as transformers?

Will transformers still be the dominant DL architecture in 2026?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2026, will it seem as if deep learning hit a wall by EOY 2025?

[Will Brown] Will Transformative AI look like an abundance of specialized models, in August 2026?

Related questions

Will there be any large-scale protests about content generated by Deep Learning by the end of 2025?

By the start of 2026, will I still think that transformers are the main architecture for tasks related to natural language processing?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will geometric deep learning turn out to be as influential an idea as transformers?

Will transformers still be the dominant DL architecture in 2026?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2026, will it seem as if deep learning hit a wall by EOY 2025?

[Will Brown] Will Transformative AI look like an abundance of specialized models, in August 2026?

© Manifold Markets, Inc.•Terms•Privacy