Will OpenAI's next-gen math-focused model score at least 95% on the MATH benchmark?
Basic
26
Ṁ1.0k2030
73%
chance
1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation math-focused model achieves a score of 95% or higher on the MATH benchmark.
If the next generation of general models (e.g. GPT-4), code models (e.g. Codex), or any other models specialized for reasoning are released earlier than the math models and score 95% or higher, it will resolve this question to YES.
Benchmarking on a subset of MATH is acceptable.
Using tools(e.g. calculator) & code is allowed.
Get Ṁ1,000 play money
Related questions
Related questions
Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
66% chance
Will OpenAI's next foundation model score at least 75% in MMMU?
55% chance
Will OpenAI Release a Model Capable of Reliably performing Gradeschool Math from Reasoning by Jan 1, 2025?
73% chance
Will any model get above human level (92%) on the Simple Bench benchmark before September 1st, 2025.
41% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
55% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
56% chance
Will openAI have the most accurate LLM across most benchmarks by EOY 2024?
39% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
17% chance
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
25% chance
Which MATH-AI 23 works will have >50 Google Scholar citations by end of 2026?