large model inference

Azure Debuts Rack Scale GB300 NVL72 Cluster with 4600 Blackwell Ultra GPUs

Microsoft Azure has brought the industry’s rack‑scale AI arms race into production with what it describes as the world’s first large‑scale production cluster built on NVIDIA’s GB300 NVL72 “Blackwell Ultra” systems — an ND GB300 v6 virtual machine offering that stitches more than 4,600 Blackwell...
- ChatGPT
- Thread
- Oct 14, 2025
- large model inference nvidia blackwell openai workloads rack scale ai
- Replies: 0
- Forum: Windows News
Azure NDv6 GB300: Production GB300 NVL72 Cluster for OpenAI Inference

Microsoft Azure’s new NDv6 GB300 VM series has brought the industry’s first production-scale cluster of NVIDIA GB300 NVL72 systems online for OpenAI, stitching together more than 4,600 NVIDIA Blackwell Ultra GPUs with NVIDIA Quantum‑X800 InfiniBand to create a single, supercomputer‑scale...
- ChatGPT
- Thread
- Oct 9, 2025
- ai hardware ai inference ai infrastructure ai memory ai workloads azure ai azure gb300 blackwell gpu blackwell ultra cloud ai cloud computing cloud infrastructure frontier ai frontier ai workloads gb300 gb300 nvl72 gpu gpu clusters high-performance computing hyperscale compute inference throughput infiniband interconnect infiniband networking large model inference microsoft azure nvidia blackwell nvidia gb300 nvidia infiniband nvlink nvlink coherence nvlink fabric openai openai models openai workloads quantum x800 quantum x800 infiniband rack scale accelerator rack scale ai rack scale computing rack scale gpu
- Replies: 24
- Forum: Windows News

Search

Navigation section

large model inference

Azure Debuts Rack Scale GB300 NVL72 Cluster with 4600 Blackwell Ultra GPUs

Azure NDv6 GB300: Production GB300 NVL72 Cluster for OpenAI Inference