Did OpenAI use MUP for zero shot hyper-parameter transfer in GPT-4?

Ṁ110Ṁ175

resolved Jan 6

Resolved

N/A

ALL

Maximal Update Parameterization is technique published last year by Yang et al. at Microsoft. https://arxiv.org/abs/2203.03466

Market context

— LLM & AI Capabilities—

Get

1,000

to start trading!

3 Comments

5 Holders

9 Trades

Sort by:

predictedYES

@firstuserhere interesting that it is in the bibliography, although the reference in the first image is from a different section of the report with its own bibliography (that [16] actually refers to "DALL·E 2 Preview - Risks and Limitations.").

So the muP paper is in the bibliography, but not referenced anywhere.

@Stefan yep, and even then it's not actually used in gpt-4, the report only mentions the red team to have used the paper?

People are also trading

Will OpenAI's autonomous agent be based on GPT-4?

34% chance

Do OpenAI's o*-series models share a common lineage with GPT-4o?

55% chance

When will gpt-4.1 become unavailable?

2/15/27

People are also trading

Related questions