Discussion about this post

User's avatar
Tectonyx's avatar

Cooling is likely the real bottleneck. Air → liquid → immersion → maybe cryogenic. As GPT-likes scale and inference workloads run longer, the cooling load only compounds.

Expand full comment
Paul Mah's avatar

It seems like Nvidia plans to keep with direct to chip cooling. Just spoke to a data centre building Nvidia's cloud - the real challenge is the data centre - it must be rebuilt (again) to support 600kW racks.

Expand full comment
1 more comment...

No posts