Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?
24
200
Ṁ937Ṁ470
2030
69%
chance
1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.
If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.
Benchmarking on a subset of MATH is acceptable.
Using tools(e.g. calculator) & code is allowed.
Get Ṁ200 play money
Related questions
Will an AI get gold on any International Math Olympiad by 2025?
22% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
39% chance
Will Gemini achieve a score above 90% on the MATH benchmark?
46% chance
Will an AI model outperform 95% of Manifold users on accuracy before 2026?
67% chance
Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?
73% chance
Will openAI have the most accurate LLM across most benchmarks by EOY 2024?
39% chance
Will an AI win the $5 million AI Math Olympiad Prize before August 2024?
7% chance
Will an AI get silver on any International Math Olympiad by 2025?
36% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
54% chance
Will AIs be widely recognized as having developed a new, innovative, foundational mathematical theory before 2035?
44% chance