By 2028 will we be able to identify distinct submodules/algorithms within LLMs? | Manifold

By 2028 will we be able to identify distinct submodules/algorithms within LLMs?

21

1kṀ1620

2028

76%

chance

1H

6H

1D

1W

1M

ALL

Roughly: will we be able to examine an LLM and extract some identifiable sub-module accomplishing an understandable task (e.g. "addition" or "inference on some decision tree" or "quicksort"). For instance it could be some set of neurons from layers L_1, ..., L_k that when run on its own executes the specified algorithm.

It must also be demonstrated that the LLM actually uses the submodule in some interpretable way. e.g. if the module implements quicksort, a demonstration might be that modifying the module to implement reversed quicksort causes the LLM to produce reverse sorted data when asked for sorted data.

The work must be done for an LLM at least as capable as OPT-3 66B.

The work must identify at least 10 submodules, or identify at least one while proving that no others exist.

If it turns out that the question is ill-posed in a way that can't be fixed with some minor tweaks, I'll resolve N/A.

Up until 2026 I may refine the criteria here, either in response to feedback from predictors or future research giving me a better way to ask the question.

Technical AI Timelines

Technical AI Safety

Mechanistic interpretability

Get

1,000

to start trading!

People are also trading

Will LLMs be able to formally verify non-trivial programs by the end of 2025?

Will researchers extract a novel program from the weights of an LLM into a Procedural/OO programming language by 2026?

Will the best LLM in 2027 have <1 trillion parameters?

By 2025 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will the best LLM in 2026 have <1 trillion parameters?

Will the best LLM in 2025 have <1 trillion parameters?

By 2028, given a one-page description of a task X, AI can download and fine- tune existing open source LLMs for X?

Will the best LLM in 2025 have <500 billion parameters?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

By 2029 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Related questions

Will LLMs be able to formally verify non-trivial programs by the end of 2025?

Will researchers extract a novel program from the weights of an LLM into a Procedural/OO programming language by 2026?

Will the best LLM in 2027 have <1 trillion parameters?

By 2025 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will the best LLM in 2026 have <1 trillion parameters?

Will the best LLM in 2025 have <1 trillion parameters?

By 2028, given a one-page description of a task X, AI can download and fine- tune existing open source LLMs for X?

Will the best LLM in 2025 have <500 billion parameters?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

By 2029 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

© Manifold Markets, Inc.•Terms•Privacy