Did OpenAI use MUP for zero shot hyper-parameter transfer in GPT-4?

110Ṁ169

Dec 31

81%

chance

ALL

Maximal Update Parameterization is technique published last year by Yang et al. at Microsoft. https://arxiv.org/abs/2203.03466

— LLM & AI Capabilities—

Get

1,000

to start trading!

3 Comments

4 Holders

8 Trades

Sort by:

predictedYES

@firstuserhere interesting that it is in the bibliography, although the reference in the first image is from a different section of the report with its own bibliography (that [16] actually refers to "DALL·E 2 Preview - Risks and Limitations.").

So the muP paper is in the bibliography, but not referenced anywhere.

@Stefan yep, and even then it's not actually used in gpt-4, the report only mentions the red team to have used the paper?

People are also trading

Did OpenAI transcribe Youtube videos to train a GPT model as claimed by NYT?

89% chance

Will OpenAI release true multimodal image generation for GPT-4.5 before 2026?

0% chance

Will there be evidence in 2025 that in April 2023, OpenAI had a GPT-4.5 or higher model?

15% chance

Will OpenAI's autonomous agent be based on GPT-4?

34% chance

Do OpenAI's o*-series models share a common lineage with GPT-4o?

55% chance

When will OpenAI remove 4o from ChatGPT?

11/27/26

People are also trading

Related questions