What will be the maximum achievable flop utilization on the next generation of Nvidia server chips?
Basic
33
แน€1.1k
2025
1.1%
<30%
1.5%
30-40%
7%
40-50%
10%
50-60%
27%
60-70%
24%
70-80%
9%
80-90%
20%
90-100%

Concretely, what is the best FLOPS (Floating Point Operations per Second) the next generation of Nvidia server cards will be able to achieve on fp16 matrix multiplications on matrices generated by the normal distribution, divided by the maximum theoretical FLOPS that Nvidia reports?

For example, for A100s, it's possible to achieve about 280+ TeraFLOPS out of a maximum of 312 TeraFLOPS, for a maximum flop utilization of ~90%.

On H100s, it seems to be closer around 700 TeraFLOPS, out of a maximum of 1000.

Will resolve when values seem clear after HNext cards are released, or a maximum of one year after Nvidia announces it.

To clarify, this will be for the B100, not the B200.

Get แน€600 play money