How many will Gemini 3.0 achieve? [Read description]
15
1kṀ4601
2026
1.65
expected
3%
0
54%
1
30%
2
5%
3
5%
4
3%
5

Resolves to the number of these the best Gemini 3 model achieves at launch. If one of them is not known by the time /Balasar/gemini-3-exceeds-expectations resolves, I will count it as not having happened.

  • Update 2025-12-20 (PST) (AI summary of creator comment): When determining which Gemini 3 model's scores to count, "best" refers to the overall model, not benchmark-by-benchmark scores. If a non-"best" model (like Flash) achieves higher scores on individual benchmarks, those scores will likely not be counted unless that model achieves more criteria overall by resolution time.

Get
Ṁ1,000
to start trading!
Sort by:
bought Ṁ260 NO

@robert Flash released a month later. It's not considered the best model, but apparently, it did score above 75% on SWE. So, how would this item resolve?

@adonisds The use of "best" refers to model, rather than benchmark by benchmark score. If flash achieves more by the time the "exceeds expectations" market resolves, I guess I would count all of flash's results. But that seems unlikely.

Tldr: probably doesn't count

bought Ṁ150 NO

HLE and SWE didn't happen

Artificial Intelligence Index did
The other 2 are pending resolution

is HLE with tools or without?

@Fynn I will resolve based on https://scale.com/leaderboard/humanitys_last_exam

Not sure if that is with or without tools

© Manifold Markets, Inc.TermsPrivacy