Will a publicly-available LLM achieve gold on IMO before 2026? | Manifold

Will a publicly-available LLM achieve gold on IMO before 2026?

46

1kṀ9985

2026

10%

chance

4

1H

6H

1D

1W

1M

ALL

Test set: Performance on a pre-existing IMO is acceptable only if the developers claim it was not in the training set, or compelling third party evidence of this is found. Queries must be posed as to IMO participants i.e. with natural language and image input.

Model details: Publicly available means query-able by the public and/or API access. The model need not be open-weights. No internet search allowed. Arbitrary scaffolding, search, program use etc. allowed. Multi-modal systems count as LLMs. If there's a modular system which is part-LM, part-prover if the prover uses parameters which are also back-propagated through during natural language pre-training then it counts as an LM. If there's significant uncertainty about whether this is true e.g. if there's mixed reporting on how modular a closed-source model is, then I will wait to resolve for up to a year. Wall-clock runtime (or effective serial run-time if parallel calls are used) must be less than IMO time i.e. 9 hours for 6 questions.

I will take into account feedback on resolution criteria until September 2024, after which I will try to keep changes to resolution criteria minimal.

IMO Grand Challenge

Get

1,000

to start trading!

People are also trading

Will an AI get gold on any International Math Olympiad by the end of 2025?

-4% 1d25% chance

Will an AI win a gold medal on International Math Olympiad (IMO) 2025?

What will be the IMO 2025 gold cutoff?

AI IMO 2025: How many AI labs announce a Gold performance at the IMO in 2025?

Will an AI get gold on any International Math Olympiad before 2027?

Will China have the best LLM by the end of 2025?

Conditional on AI getting at least bronze on the IMO, will it get gold by 2025?

Will an LLM improve its own ability along some important metric well beyond the best trained LLMs before 2026?

Will Apple release its own LLM on par with state of the art LLMs before 2026?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

Related questions

Will an AI get gold on any International Math Olympiad by the end of 2025?

Will an AI win a gold medal on International Math Olympiad (IMO) 2025?

What will be the IMO 2025 gold cutoff?

AI IMO 2025: How many AI labs announce a Gold performance at the IMO in 2025?

Will an AI get gold on any International Math Olympiad before 2027?

Will China have the best LLM by the end of 2025?

Conditional on AI getting at least bronze on the IMO, will it get gold by 2025?

Will an LLM improve its own ability along some important metric well beyond the best trained LLMs before 2026?

Will Apple release its own LLM on par with state of the art LLMs before 2026?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

© Manifold Markets, Inc.•Terms•Privacy