Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Plus
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
By the end of Q1 2025 will an open source model beat OpenAI’s o1 model?
76% chance
Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2024?
9% chance
By the end of Q2 2025 will an open source model beat OpenAI’s o1 model?
76% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
81% chance
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
58% chance
Will there be a model that has a 75% win rate against the latest iteration of GPT-4 as of January 1st, 2025?
62% chance
Will OpenAI launch a significantly better model for ChatGPT paying users in 2024? (>= 100 points diff on ChatBot Arena)
21% chance
By the end of Q3 2025 will an open source model beat OpenAI’s o1 model?
77% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
75% chance