Will OpenAI release GPT-4 finetuning by Fall 2023?
138
1.7K
Dec 22
37%
chance

Resolves YES if GPT-4 is publicly available to be finetuned by 2023-12-21 19:19:00 PST. NO otherwise.

with fine-tuning for GPT-4 coming this fall

Context(tweet embed):

Get Ṁ500 play money

Related questions

Sort by:
JamesBabcock avatar
James Babcockbought Ṁ50 of YES

Arb trading with https://manifold.markets/getby/will-gpt4-finetuning-be-available-b because I predict that whenever they release this, it will either be before Dec21 or after Dec31.

firstuserhere avatar
firstuserherebought Ṁ100 of NO

Unlikely if the entire team quits

ishanmehta avatar
ishan mehtabought Ṁ10 of NO
Comment hidden
AaronBreckenridge avatar
Aaron Breckenridgepredicts NO

From the announcement today:

GPT-4 fine tuning experimental access

We’re creating an experimental access program for GPT-4 fine-tuning. Preliminary results indicate that GPT-4 fine-tuning requires more work to achieve meaningful improvements over the base model compared to the substantial gains realized with GPT-3.5 fine-tuning. As quality and safety for GPT-4 fine-tuning improves, developers actively using GPT-3.5 fine-tuning will be presented with an option to apply to the GPT-4 program within their fine-tuning console.

AaronBreckenridge avatar
Aaron Breckenridgepredicts NO

I’d say this is still not “publicly available” since it requires applying for a private program.

Mira avatar
Mirapredicts YES

@AaronBreckenridge If I get access to GPT-4 finetuning(I'll apply if given the option), or if the public can use finetuned GPT-4 models even if they can't train them, those will definitely count for this market.

A EULA is fine, but if it's strictly b2b(the models can only be used in a non-personal organization) or you need to sign NDAs, that won't count. Has to be available to "non-affiliates" of OpenAI.

If it's partially released to a small group of members of the public, but it's a lucky find like a lootbox in a mobile game, that sounds like NO or resolve it 50%. I'll count it YES if at least the signup form is public, and there are reports of at least some non-affiliates getting access to it.

AaronBreckenridge avatar
Aaron Breckenridgepredicts NO

This description says nothing about using fine tuning.

“Releasing fine-tuning” would necessitate training per the tweet linked in the description: “fine-tuning lets you train…”

You also say it must be “publicly available to fine tune”.

MrLuke255 avatar
MrLuke255

@Mira It looks like you have a conflict of interest

nanob0nus avatar
nanob0nuspredicts YES

@chrisjbillington just felt like this is probably NO and noone dares to contradict

ShadowyZephyr avatar

Not only is there the fact that fine tuning it would be INCREDIBLY slow, GPT-4 has a LOT more capabilities. By the time they released GPT-3.5 finetuning, open source models had almost caught up to GPT-3.5. Allowing people to finetune GPT-4 would have implications that OpenAI themselves probably don't even fully understand yet. The applications that people could find would be both great and terrifying.

Inception avatar
Inception
jonsimon avatar
Jon Simonbought Ṁ0 of NO

You guys understand that this model is >10x the size of the ChatGPT right? Their own in house engineers can probably barely fine-tune this thing

nanob0nus avatar
nanob0nusbought Ṁ10 of YES

@jonsimon Well, the procedures can be the same, just slower.

jonsimon avatar
Jon Simonpredicts NO

@gigab0nus Computing gradients is quadratic in the number of parameters, so we're talking 100x slower. That's a lot slower.

nanob0nus avatar
nanob0nuspredicts YES

@jonsimon You often don‘t fine-tune the whole network. But yes it will be much slower. If they say so, they will surely find a way to make the fine-tune experience acceptable.

TomPotter avatar
Tom Potter

@jonsimon Why couldn't they use something like LoRA to massively save compute? (and also importantly, memory loading multiple versions of the model for inference)

jonsimon avatar
Jon Simonpredicts NO

@TomPotter They probably could, but in general it's more practically complicated to add a LoRA adapter, especially for a mixture-of-experts style model like GPT-4.

Not saying it's not doable, but I am saying that you shouldn't expect it to be as straightforward as it is for GPT-3.5

kcs avatar
kcs

@jonsimon Do you have a source for the quadratic cost of computing gradients? Doesn't backprop have approximately the same time complexity as the forward step?