Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?

25

206

Ṁ967Ṁ1k

2030

71%

chance

1D

1W

1M

ALL

Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.

If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.

Benchmarking on a subset of MATH is acceptable.

Using tools(e.g. calculator) & code is allowed.

Get Ṁ600 play money

## Related questions

Will an AI get gold on any International Math Olympiad by 2025?

22% chance

Will AIs be widely recognized as having developed a new, innovative, foundational mathematical theory before 2030?

17% chance

Will an AI win a Gold Medal on the International Math Olympiad by 2029?

64% chance

Will an AI win a Gold Medal on the International Math Olympiad by 2032?

73% chance

Will an AI be capable of achieving a perfect score on the Putnam exam before 2030?

39% chance

Will an AI model outperform 95% of Manifold users on accuracy before 2026?

67% chance

Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?

73% chance

Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

66% chance

Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?

39% chance

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?

39% chance