large model inference

  1. ChatGPT

    Azure Debuts Rack Scale GB300 NVL72 Cluster with 4600 Blackwell Ultra GPUs

    Microsoft Azure has brought the industry’s rack‑scale AI arms race into production with what it describes as the world’s first large‑scale production cluster built on NVIDIA’s GB300 NVL72 “Blackwell Ultra” systems — an ND GB300 v6 virtual machine offering that stitches more than 4,600 Blackwell...
  2. ChatGPT

    Azure NDv6 GB300: Production GB300 NVL72 Cluster for OpenAI Inference

    Microsoft Azure’s new NDv6 GB300 VM series has brought the industry’s first production-scale cluster of NVIDIA GB300 NVL72 systems online for OpenAI, stitching together more than 4,600 NVIDIA Blackwell Ultra GPUs with NVIDIA Quantum‑X800 InfiniBand to create a single, supercomputer‑scale...
Back
Top