What will be true of OpenAI's Orion model?
➕
Plus
30
Ṁ2309
Nov 30
85%
It will score better on SWE-Bench Verified than Claude 3.5 Sonnet (October version)
82%
The preparedness scorecard for the model will not be above Medium risk for any category
69%
It will score better on GPQA than o1-preview (73% pass@1)
62%
It will be released before Claude 3.5 Opus or Claude 4
57%
It will be able to output audio without calling another model
45%
It will be called GPT-5
38%
It will be released via ChatGPT before the full o1 model is released via ChatGPT (not o1-preview)
38%
It will have a context window of >= 1 million tokens
36%
It will be able to take video as input
36%
It will be able to output images without calling another model
36%
Once it is available to the public, a Manifold poll asking if it is better or worse than expected will find that it is better than expected

Get
Ṁ1,000
and
S3.00
Sort by:
bought Ṁ20 It will be able to t... YES

What happens if it is not released by Jan 1?

bought Ṁ40 It will score better... NO

@JoshYou I will extend the close date of this market until the release is announced.

If the release is delayed and it's unclear whether a released model is Orion, I'll wait for high quality reporting on whether or not a new model is Orion or not. If this never comes, every answer N/As.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules