You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
rack scale gpu
About this tag
The rack scale gpu tag covers discussions about large-scale GPU clusters designed for AI inference and training, with a focus on Microsoft Azure's NDv6 GB300 VM series. This series integrates thousands of NVIDIA Blackwell Ultra GPUs using NVIDIA Quantum-X800 InfiniBand networking to create a single, supercomputer-scale platform. The content highlights the co-engineering between cloud providers and accelerator vendors to deliver rack-scale GPU solutions for demanding workloads, particularly for OpenAI inference. Key themes include production-scale deployment, high-performance interconnects, and the use of NVIDIA GB300 NVL72 systems in enterprise cloud environments.
Microsoft Azure’s new NDv6 GB300 VM series has brought the industry’s first production-scale cluster of NVIDIA GB300 NVL72 systems online for OpenAI, stitching together more than 4,600 NVIDIA Blackwell Ultra GPUs with NVIDIA Quantum‑X800 InfiniBand to create a single, supercomputer‑scale...