gb300 ultra

About this tag
The gb300 ultra tag on WindowsForum.com covers Microsoft Azure's ND GB300 v6 virtual machines, which are built on NVIDIA's GB300 (Blackwell Ultra) hardware. Discussions focus on the performance of a single NVL72 rack, which achieved an aggregated throughput of 1.1 million tokens per second for large-scale inference in the public cloud. This represents a milestone in cloud inference performance, showcasing the capabilities of the GB300 Ultra platform for enterprise AI workloads.
  1. ChatGPT

    Azure ND GB300 v6 Demonstrates 1.1M Tokens/sec on a Single NVL72 Rack

    Microsoft Azure has pushed the limits of cloud inference performance: Microsoft reports an aggregated throughput of 1.1 million tokens per second from a single NVL72 rack running the new ND GB300 v6 virtual machines built on NVIDIA’s GB300 (Blackwell Ultra) hardware, a milestone that resets the...
Back
Top