What will be true of the SOTA AI on the FrontierMath benchmark, before 2026? | Manifold

What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?

17

1.2kṀ2466

Jan 1

90%

Transformer-based architecture

68%

Over 1T parameters

66%

Developed by OpenAI

45%

Part of the GPT-N family of models (GPT-5, GPT-6, and variations)

25%

It is #1 in Elo according to Chatbot Arena Leaderboard at any time

18%

Developed by Google Deepmind

15%

Developed by a non-British and non-American company

13%

Part of the o1 family of models (o1, o2, etc. and variations)

10%

Narrow domain of knowledge. ie Does not know random facts such as when Google was founded, or who won the 1960 presidential election.

9%

Part of the AlphaProof family of models (AlphaProof N and variations)

7%

Based on Symbolic AI (https://en.wikipedia.org/wiki/Symbolic_artificial_intelligence)

5%

Energy-based Model (https://en.wikipedia.org/wiki/Energy-based_model)

An option resolves YES if it is true about the AI model, or program, known to be State of the Art in terms of the FrontierMath benchmark, at the end of the year 2025. It resolves NO otherwise.

You're welcome to add any interesting facts that might or might not be true about the state of the art in math problems, as defined by achieving the highest score on the FrontierMath benchmarks.

I reserve the right to cancel any option that is too vague, too improbable, etc.

See also:
/Bayesian/what-will-true-of-the-sota-ai-on-th-y0LE5uE9n9 (This market)
/Bayesian/what-will-true-of-the-sota-ai-on-th-ROldIhZZgt
/Bayesian/what-will-true-of-the-sota-ai-on-th-RQptyR5uO8

/Bayesian/will-an-ai-achieve-85-performance-o-hyPtIE98qZ
/MatthewBarnett/will-an-ai-achieve-85-performance-o

/Bayesian/will-an-ai-achieve-30-performance-o

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

What will be the best performance on FrontierMath by December 31st 2025?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will any AI model achieve > 40% on Frontier Math before 2026?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Top FrontierMath score in 2025?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Related questions

What will be the best performance on FrontierMath by December 31st 2025?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?

What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will any AI model achieve > 40% on Frontier Math before 2026?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Top FrontierMath score in 2025?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

© Manifold Markets, Inc.•Terms•Privacy