Will Google's Gemini Ultra model turn out to be "mehr Schein als sein"?
48
560
950
Mar 31
80%
chance

"mehr Schein als sein" = "more appearance than substance"

I saw Google's demo of Gemini (tweets linked below) and it looked amazing but overly rehearsed. I get that demos are used to sell a product. OpenAI Dall-E 3 demo was better than reality but it was still within limits. I wonder what I will think of Google Gemini model after getting a chance to use it.

I will resolve this market after using Gemini for at-least two weeks. I will be kind to Google and will only resolve this market as "Yes" if my experience doesn't match the stuff we saw over the past couple of days at all.

Get Ṁ200 play money
Sort by:

There's a market at 90% which says it will beat chatgpt 4 on benchmarks, but this is at 75% for "crap"/hype.

bought Ṁ20 of YES

@StephanHeijl tried bard since I got an email saying it was now Gemini Pro. Not good. It seems to have minimal awareness of the context window so far and it doesn't read things you say to it objectively, it interprets them in favour of its prior assertions. It might do well on some types of benchmarks but then do badly as a conversational chatbot when the subject is complicated in any way. Hopefully it's because it's new.

bought Ṁ30 of NO

Ultra or pro?

@CalvinLoveland I will use the best version I can get my hands on. I am also willing to pay up to 20-25 EUR (same as ChatGPT).

@Soli the demos are presumably Ultra, but Ultra isn't available yet. So the best version you can get now is Pro, but that's not really the relevant comparison.

@chrisjbillington when do you think i will be able to use ultra? if its a couple of weeks then i think we can push the deadline of the market but if it is a couple of month then i think it is fair to use the worse model since in some ways it is misleading to demo something, say it will be available soon and then just release something else for months

@Soli Don't know, but markets here suggest unlikely this month, probably February or March, and unlikely to be later than that. That sounds reasonable to me.

I think this market becomes a completely different one if you test Pro instead of Ultra, as Pro is not even in the same weight class as Ultra, you might as well resolve YES right away without actually testing in that case.

So despite it being poor form to demo something and not released it for a bit, I don't think this question is particularly meaningful unless it's about Ultra.

@chrisjbillington ok i am fine with waiting for Ultra if participants of this market do not see a problem in this ✌🏼- maybe we can ping the top 5 yes holders?

@chrisjbillington i changed the title to include ultra let’s see if anyone complains in the next 48 hours orherwise we consider this change done

The demo, it seems like it's modalities include continuous video input, I saw after Dev Day someone make via the API's a similar feature, but Gemini (Ultra?) has this from day 1? If so I think that will impress you, even if it's not as impressive in actual use compared to the demo, I think you'll come down on substance.

bought Ṁ25 NO from 82% to 79%
bought Ṁ90 of YES

@VAPOR the demo was faked, in that they actually just took screenshots from the videos and provided those to Gemini Ultra along with very specific prompts

More related questions