At least one of the most powerful neural nets at end of 2026 will be trained using 10^27 FLOPs

1.1kṀ4255

2027

84%

chance

ALL

Resolves YES if at least one of the most powerfull neural nets publicly known to exist by end of 2026 was trained using at least 10^27 FLOPs. This is ~30 exaFLOP/s years. It does not matter if the compute is distributed, as long as one of the largest models used it. A neural net which uses 10^27 FLOPs but is inferior to other models does not count. Low precision floating point such as fp32, fp16, or fp8 is permitted.

Resolves NO if no such model exists by end of 2026.

If we have no good estimates of training compute usage of top models, resolves N/A.

Show less

Machine Learning

Get

1,000

to start trading!

People are also trading

At least one of the most powerful neural nets at end of 2030 will be trained using 10^27 FLOPs

93% chance

Will there be an announcement of a model with a training compute of over 1e30 FLOPs by the end of 2025?

5% chance

Will an AI model use more than 1e28 FLOPS in training before 2026?

8% chance

Will a machine learning training run exceed 10^26 FLOP in China before 2026?

52% chance

Will the largest machine learning training run (in FLOP) as of the end of 2025 be in the United States?

86% chance

Will a machine learning training run exceed 10^27 FLOP in China before 2028?

44% chance

What will be the parameter count (in trillions) of the largest neural network by the end of 2030?

Will a machine learning training run exceed 10^26 FLOP in China before 2029?

82% chance

Will a machine learning training run exceed 10^26 FLOP in China before 2027?

86% chance

Will a machine learning training run exceed 10^26 FLOP in China before 2028?

Sort by:

> We used Multislice Training to run what we believe to be the world’s largest publicly disclosed LLM distributed training job (in terms of the number of chips used for training) on a compute cluster of 50,944 Cloud TPU v5e chips (spanning 199 Cloud TPU v5e pods) that is capable of achieving 10 exa-FLOPs (16-bit), or 20 exa-OPs (8-bit), of total peak performance.
- https://cloud.google.com/blog/products/compute/the-worlds-largest-distributed-llm-training-job-on-tpu-v5e

Note that for resolution purposes, I will permit 8-bit precision if they can productively utilize it in training.

If I'm doing my math right, it would take ~18 months for this to satisfy the requirements of this market. (Assuming they could get full peak utilization out of it.)