Will a lab train a >=1e26 FLOP state space model before the end of 2025? | Manifold

Will a lab train a >=1e26 FLOP state space model before the end of 2025?

35

Ṁ1kṀ3.5k

Jan 1

9%

chance

1H

6H

1D

1W

1M

ALL

The Transformer model, introduced by Vaswani et al. in 2017, has been a cornerstone in the field of deep learning, particularly for tasks involving natural language processing, computer vision, and more. However, the computational inefficiencies of Transformers, especially in handling long sequences, have been a growing concern.

Recently, a new architecture named Mamba introduced by Gu and Dao, which leverages selective state space models (SSMs), has been introduced. This model addresses some of the key inefficiencies of Transformers by allowing selective propagation or forgetting of information along a sequence based on the current input token. Mamba demonstrates promising results, achieving state-of-the-art performance in various domains including language, audio, and genomics, sometimes even outperforming Transformers of similar or larger sizes.

This question will resolve positively if, by Jan 1, 2026, credible evidence exists that a state space model is trained using 1.00E26 FLOP or more.

This criterion is chosen as it will likely take many millions of dollars to train a 1e26 FLOP model, even in 2025, and is therefore a useful proxy for 'frontier AI labs making large bets on SSMs'.

Market context

Machine Learning

Get

1,000

to start trading!

People are also trading

Most training run compute greater than 2e27 FLOP by EOY 2026?

Will a new lab create a top-performing AI frontier model before 2028?

Will models be able to do the work of an AI researcher/engineer before 2027?

Will a machine learning training run exceed 10^26 FLOP in China before 2029?

Will a machine learning training run exceed 10^26 FLOP in China before 2028?

Will a machine learning training run exceed 10^26 FLOP in China before 2027?

Will a machine learning training run exceed 10^25 FLOP in China before 2027?

Will a machine learning training run exceed 10^27 FLOP in China before 2028?

Will the largest machine learning training run (in FLOP) as of the end of 2035 be in the United States?

End of pre-training era for language models: Will an LM fine-tune for more FLOPs than it is pre-trained for, before 2026

Sort by:

This got resolved 9.5 months early?

@dsj Sorry about that. I think it's now back open.

@TamayBesiroglu Resolves No?

@TamayBesiroglu I appear to have messed up dates, sorry. This shouldn't have resolved yet?

Ah, yes. @mods can you please unresolve this market?

sold Ṁ18 YES

Is there any evidence that in-context learning + RL works well for state-space models? If a frontier lab wishes to train a sota reasoning model

https://huggingface.co/nvidia/mamba2-hybrid-8b-3t-4k

Do hybrid models count?

People are also trading

Most training run compute greater than 2e27 FLOP by EOY 2026?

Will a new lab create a top-performing AI frontier model before 2028?

Will models be able to do the work of an AI researcher/engineer before 2027?

Will a machine learning training run exceed 10^26 FLOP in China before 2029?

Will a machine learning training run exceed 10^26 FLOP in China before 2028?

Will a machine learning training run exceed 10^26 FLOP in China before 2027?

Will a machine learning training run exceed 10^25 FLOP in China before 2027?

Will a machine learning training run exceed 10^27 FLOP in China before 2028?

Will the largest machine learning training run (in FLOP) as of the end of 2035 be in the United States?

End of pre-training era for language models: Will an LM fine-tune for more FLOPs than it is pre-trained for, before 2026

Related questions

Most training run compute greater than 2e27 FLOP by EOY 2026?

Will a new lab create a top-performing AI frontier model before 2028?

Will models be able to do the work of an AI researcher/engineer before 2027?

Will a machine learning training run exceed 10^26 FLOP in China before 2029?

Will a machine learning training run exceed 10^26 FLOP in China before 2028?

Will a machine learning training run exceed 10^26 FLOP in China before 2027?

Will a machine learning training run exceed 10^25 FLOP in China before 2027?

Will a machine learning training run exceed 10^27 FLOP in China before 2028?

Will the largest machine learning training run (in FLOP) as of the end of 2035 be in the United States?

End of pre-training era for language models: Will an LM fine-tune for more FLOPs than it is pre-trained for, before 2026

© Manifold Markets, Inc.•Terms•Privacy