Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?

Basic

26

Ṁ1.0k2030

73%

chance

1D

1W

1M

ALL

Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.

If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.

Benchmarking on a subset of MATH is acceptable.

Using tools(e.g. calculator) & code is allowed.

Get Ṁ1,000 play money

## Related questions

## Related questions

Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?

66% chance

Will OpenAI's next foundation model score at least 75% in MMMU?

55% chance

Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?

73% chance

Will any model get above human level (92%) on the Simple Bench benchmark before September 1st, 2025.

41% chance

Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?

55% chance

Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?

56% chance

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?

39% chance

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

17% chance

Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?

25% chance

Which MATH-AI 23 works will have >50 Google Scholar citations by end of 2026?