Will an open-source LLM under 10B parameters surpass Claude 3.5 Haiku by EOY 2025? | Manifold

Will an open-source LLM under 10B parameters surpass Claude 3.5 Haiku by EOY 2025?

12

1kṀ6694

2026

99%

chance

1H

6H

1D

1W

1M

ALL

Resolves YES if a language model is released before January 1st 2026, that:

1. Has freely accessible weights, meaning the general public can download it and run it locally, regardless of additional restrictions.
2. Is explicitly described as having less than 10 billion parameters. (If the actual parameter number is less than 10B and was e.g. rounded up, this doesn't count as being under 10B.)
3. Achieves an Arena Score on http://lmarena.ai/leaderboard greater than the score of Claude 3.5 Haiku 2024-10-22, with both scores measured at the same point in time.

In the event that the way Arena Scores are calculated changes significantly or that specific Haiku model is no longer ranked before EOY2025 (I would be very surprised if this happens), I will try to find a suitable replacement criteria that traders can agree is fair. If no such criteria can be found, this market will N/A.

Update 2025-06-26 (PST) (AI summary of creator comment): The creator has announced their intention to resolve this market to YES in 24 hours, citing the release of Gemma 3n 4b. Please see the linked comment for their full reasoning.

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Conditional on OpenAI releasing an open-source LLM in 2025, will it exceed o3-mini in AIME 2025 score?

Will the best LLM in 2025 have <500 billion parameters?

What will be true of OpenAI's best LLM by EOY 2025?

Will the best LLM in 2025 have <1 trillion parameters?

Size of smallest open-source LLM marching GPT 3.5's performance in 2025? (GB)

Will the best LLM in 2027 have <1 trillion parameters?

Will the best LLM in 2026 have <1 trillion parameters?

Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

Will the best LLM in 2027 have <250 billion parameters?

Related questions

Conditional on OpenAI releasing an open-source LLM in 2025, will it exceed o3-mini in AIME 2025 score?

Will the best LLM in 2025 have <500 billion parameters?

What will be true of OpenAI's best LLM by EOY 2025?

Will the best LLM in 2025 have <1 trillion parameters?

Size of smallest open-source LLM marching GPT 3.5's performance in 2025? (GB)

Will the best LLM in 2027 have <1 trillion parameters?

Will the best LLM in 2026 have <1 trillion parameters?

Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

Will the best LLM in 2027 have <250 billion parameters?

© Manifold Markets, Inc.•Terms•Privacy