What will be true of Llama 4? [Add answers]
16
1.6kṀ2863
Aug 16
83%
Some version of it will score better on GPQA than o1-preview (73% pass@1)
71%
It will be able to output audio
66%
Some version of it will score better on SWE-Bench Verified than Claude 3.5 Sonnet (October version)
24%
It will be able to output images
16%
Gets the high score on Humanity's Last Exam
14%
Manifold will think any variant is the best model released up to that point in a poll
8%
Gets the high score on FrontierMath
7%
Reaches Highest Arena Score on Chatbot Arena
Resolved
YES
It will have a variant with over 400 billion parameters (does not need to release immediately)
Resolved
YES
It will have a variant with over 1 trillion parameters (does not need to release immediately)

if an answer doesn't specify which model, any model in the llama 4 family meeting the criteria causes it to resolve YES

high score options are up to date of release - if llama4 gets a high score on april 1 and gpt-9000 comes out on april 2 and beats its score, answer still resolves YES

  • Update 2025-04-05 (PST) (AI summary of creator comment): Important Update:

    • Llama 4 Behemoth has not yet been released.

    • Answers that rely solely on Llama 4 Behemoth will remain unresolved until its release, unless smaller models in the Llama 4 family satisfy the criteria.

  • Update 2025-05-30 (PST) (AI summary of creator comment): The market is being reopened because Llama 4 Behemoth is not expected to be released soon. This action is consistent with the previous update that answers relying solely on Llama 4 Behemoth will remain unresolved until its release.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy