Benchmark Gap #4: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, how many months will it be before an AI is listed as a (co) first author on a published math paper? | Manifold

Benchmark Gap #4: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, how many months will it be before an AI is listed as a (co) first author on a published math paper?

9

410Ṁ599

2050

37

expected

1H

6H

1D

1W

1M

ALL

This question is meant to measure the gap between solving the main math-based benchmarks at the time of market creation, and contributing to real world mathematics.

The co first author requirement is loose: I will also accept an AI being credited with significant contributions to both deciding what to prove and the actual proof (merely contributing to the proof is not enough - I am trying to get at "the AI does the work of a mathematician" not "the AI does the work of a proof assistant"). I would also accept, for instance, the human author of the paper expressing that they would have named the AI as a coauthor if it was human, or saying that the result could not have been obtained without the assistance of the AI.

Technical AI Timelines

Get

1,000

to start trading!

Sort by:

In a lot of pure math, author order is arbitrary/alphabetical. Removing that, I second that it'll be 0. Maybe negative.

I think it is plausible that it will be <0

People already list ChatGPT as a coauthor in scientific papers but not in math yet.

People are also trading

Does AI Pareto-dominate technical but non-mathematician humans at math?

-7% 1d18% chance

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will AI contribute as much as a co-author would today to a real research mathematics paper before Jan 1 2026?

Benchmark Gap #5: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, will it be less than two years before AI models are used as entry-level data science / data analysis / statistics workers?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Benchmark Gap #8: Once a single AI gets >= 80% on FrontierMath Tier 4, how long until an AI publishes a math paper?

Will an AI model write the proof to the Riemann Hypothesis by the end of 2025?

will a paper released in 2025 by a frontier AI lab have one of their AIs as a co-author?

Will an AI co-author a mathematics research paper published in a reputable journal before the end of 2026?

Related questions

Does AI Pareto-dominate technical but non-mathematician humans at math?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will AI contribute as much as a co-author would today to a real research mathematics paper before Jan 1 2026?

Benchmark Gap #5: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, will it be less than two years before AI models are used as entry-level data science / data analysis / statistics workers?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Benchmark Gap #8: Once a single AI gets >= 80% on FrontierMath Tier 4, how long until an AI publishes a math paper?

Will an AI model write the proof to the Riemann Hypothesis by the end of 2025?

will a paper released in 2025 by a frontier AI lab have one of their AIs as a co-author?

Will an AI co-author a mathematics research paper published in a reputable journal before the end of 2026?

© Manifold Markets, Inc.•Terms•Privacy