Will Apple mention Large Language Models (LLM) at WWDC 2023?
63
665
1.3K
resolved Jun 7
Resolved
NO

LLMs are all the rage, but Apple is notoriously tight lipped about techologies until its released as a project. Will they pretend as if LLM's don't exist?

Resolves YES if LLMs are mentioned in any capacity at the WWDC 2023 keynote (developer sessions or state of the union doesn't count). Otherwise resolves NO.

Get Ṁ200 play money

🏅 Top traders

#NameTotal profit
1Ṁ774
2Ṁ180
3Ṁ97
4Ṁ87
5Ṁ51
Sort by:
predicted NO

I didn't hear any reference to LLMs (the autocomplete section came close but used a different type of NN). If you have a counterexample reply to this comment, otherwise I will be resolving NO

predicted YES

@bjubes I'd like time to watch it, can you wait for tomorrow to resolve?

predicted NO
predicted YES

@bjubes "M2 Ultra can train large transformer models". Unclear to me if this should count or not https://youtube.com/clip/UgkxxdVpoYvFfuLNBXF4BQ4D4BoesZnLlM8z

predicted NO

@bjubes I didn't hear LLMs mentioned or find that phrase in the transcript. Obviously biased though.

predicted YES

@bjubes Oh that's the same clip you were talking about. "Transformer Language Model" is the same architecture used by ChatGPT etc., so the only question is whether you consider it "Large", which is kind of subjective IMO

predicted YES

@YoavTzfati I think it should count. I don't know why it's a "different type of NN" ?

predicted NO

@bjubes depends how literal you want to be with the resolution criteria. I assumed it meant that exact phrase.

predicted NO

@YoavTzfati @WieDan Large isn't just an adjective, its a fundamental distinction. The keyboard transformer language model cannot be large, there isn't close to enough ram or compute on any handheld to even run such a model, and even GPT-esqe models run in datacenters are much slower (compare chatGPT output speed to autocorrect speed).

The "M2 Ultra can train large transformer models" is much closer. However, their choice to not mention large or language in this context is the whole spirit of this market. Apple is shying away from saying the buzzwords, and leaving it vague. I think their omission of saying LLM, even when its what everyone is thinking when they mention ML workloads is purposeful and keeping with their consistent ML branding over the AI hype.

predicted YES

@bjubes When Google presented PaLM 2 they said that the smallest version (Gecco) could be run on a phone. There are increasingly techniques to distill large language models and fine tune them with very little compute (see for example the recent QLora).

If I were Apple and wanted to take advantage of the LLM advances somehow these are the two ways I would do it - put an LLM in the IPhone, and make the macs able to train LLMs.

predicted NO

@YoavTzfati and while the Mac Pro can train models, I’m still leaning towards LLM not being “Mentioned” since they explicitly omitted the term in a natural place to bring it up. They never mentioned language at all in the mac pro section despite it being the obvious use case everyone is familiar with

predicted YES

@bjubes I mean "large transformer models" is pretty obviously referring to language models IMO.

But It's worth mentioning that I won't be offended if this resolves NO, I don't think there's a decisive case in either direction. And we should really have had this discussion in advance and not now

predicted YES

@YoavTzfati Oh another option is to get someone that didn't bet on the market to decide

predicted YES

@bjubes It's also an option to resolve n/a

predicted NO

@YoavTzfati I asked for a volunteer in Discord but no one offered. I am going to resolve NO as LLM's were not mentioned directly. While the nod to them was obvious, so was the nod to the metaverse, which was also avoided in similar Apple fashion. That was meant to be the spirit of the market, how Apple talks about tech hype in their marketing events.

But yeah in the future I'll not bet on Markets like this which can be subjective, I had thought this one would be pretty clear cut.

bought Ṁ10 of YES
predicted NO

Extended the close time to just before the keynote starts

predicted YES

Can you please edit the question to explicitly call out WWDC’s keynote? Considering only the keynote is a very different question to WWDC in general.

I still think it’s >25% for the keynote but not by much - they’ll probably announce some ML dev tooling that hypes up Apple Silicon, but they’d might call out LLMs explicitly only in a dev session

More related questions