Microsoft Azure Unveils HBv5 VMs with Custom AMD CPUs for High-Performance Computing

  • Thread Author
In a move that's likely to send ripples through the high-performance computing (HPC) landscape, Microsoft has unveiled its latest Azure HBv5 virtual machines, powered by a cutting-edge custom AMD CPU. This new CPU, which sports an impressive configuration, seems to be a rebirth of the long-rumored MI300C design, once relegated to the speculation chamber of tech enthusiasts.

A close-up of a futuristic microchip on a glowing circuit board surface.
Unpacking the CPU Specifications​

So, what precisely are we looking at with this new custom AMD EPYC CPU? Brace yourselves. Microsoft has engineered a chip that features 88 Zen 4 cores and boasts 450GB of HBM3 memory. When leveraged across four chips, these VMs unleash a staggering bandwidth of 7 TB/s, which Turbocharges performance compared to their predecessors.
Previously utilitarian CPUs like the Milan-X and Genoa-X provided some impressive bandwidth via AMD's 3D V-Cache technology. Still, with the HBv5 series, Microsoft has decided to crank things up a notch. They recognized that memory bandwidth was a critical bottleneck in HPC workloads—a detail that many may overlook while contemplating the intricacies of CPU architecture.

What's the Big Deal About HBM3?​

HBM3 (High Bandwidth Memory) is the spry new entrant in memory technology. Typically reserved for data center-grade GPUs, it elevates the game by offering higher bandwidth and lower latency compared to traditional DRAM. In the context of Azure's HBv5 VMs, HBM3 serves a dual purpose: it provides substantial memory bandwidth while allowing the CPUs to act more efficiently by relocating communication needs away from conventional RAM.

Performance Metrics Ramifications​

Let’s talk performance. Bolstering these Azure VMs with HBM3 gives them a remarkable bandwidth increase of nearly nine times over the previous Genoa-X chips and an almost 20-fold improvement over the Milan-X counterparts. The implications for users who rely on Azure for compute-intensive tasks—such as machine learning and large-scale simulations—are monumental.
The HBv5 VMs are equipped with 352 Zen 4 cores (though Microsoft has wisely disabled SMT for this configuration), ensuring that even the most demanding workloads can run with efficiency and speed. Not only will organizations be able to perform computations faster, but Azure's offerings will likely become more appealing as businesses scramble to leverage cloud computing for their computational needs.

A Custom CPU That May Not Be So Custom​

Intriguingly, this so-called custom CPU isn’t as bespoke as one might think. It reflects characteristics akin to AMD's MI300C—a chip designed but never officially released. The MI300C was initially conceptualized with the intent of merging CPU and GPU technology, featuring Zen 4 cores instead of CDNA architecture. This blurring of lines between traditional computing roles aligns well with current trends in processing technology, where workloads often demand hybrid capabilities.
Phil Park, an AMD memory engineer, shed light on the challenges faced by high-bandwidth memory designs. According to him, the commitment to HBM3 means AMD has had to make specific architectural decisions, emphasizing that HBM does not suit all CPU applications, given its bandwidth requirements. This context casts a reflective light on why market deployment has been slow in adopting HBM-equipped CPUs: they represent a considerable design and manufacturing investment, rather than a flexible mainstream offering.

Azure's Exclusive Offering: A Strategic Move​

What makes this CPU particularly noteworthy is its exclusivity to Microsoft Azure. Glenn Lockwood from Microsoft confirmed that this chip would not be available for any other distributors or as standard EPYC CPUs. This exclusivity indicates a significant strategic play by Microsoft—possibly a means to tilt Azure’s competitive balance in cloud services. By collaborating closely with AMD to develop this solution, Microsoft is not merely aiming to keep pace with competitors like AWS and Google Cloud; they’re looking to set the standard for what high-performance computing should look like.

Conclusion: Bridging the Future of HPC in the Cloud​

This latest iteration of Microsoft's Azure offerings solidifies its place in the expanding cloud landscape of HPC. By harnessing the might of AMD's HBM3-enabled custom EPYC CPUs, users can expect phenomenal advancements in computational capabilities that were previously confined to academic and research institutions.
As we anticipate further developments, it will be exciting to see how businesses will adapt to and implement the new capabilities afforded by Azure's HBv5 virtual machines. The era of ultra-efficient, memory-bountiful cloud computing is upon us, and the future looks incredibly bright for Windows users willing to tap into Azure's newest offerings. So, as you contemplate your next computing needs, remember: the frontier of high-performance computing is evolving, and it’s happening right in the cloud.

Source: Tom's Hardware AMD crafts custom EPYC CPU with HBM3 memory for Microsoft Azure – CPU with 96 Zen 4 cores and 450GB of HBM3 may be repurposed MI300C, four chips hit 7 TB/s [Updated]
 

Last edited:
On November 19, 2024, amidst the keynotes and breakout sessions at Ignite 2024, Microsoft revealed a groundbreaking addition to its high-performance computing (HPC) lineup: the Azure HBv5 Virtual Machines. Designed specifically for demanding computational workloads, these machines are engineered to tackle the growing memory bandwidth bottlenecks that many HPC users have been facing.

A modern data center room with large server racks and a central control panel.
The HBv5: A New Paradigm in HPC​

Microsoft has teamed up with AMD to harness the power of custom-built EPYC 9V64H processors exclusively available on Azure, creating a VM that significantly amplifies memory bandwidth while ensuring efficiency and configurability. With Azure HBv5, organizations can expect to push their computational limits like never before, tackling complex simulations in fields such as financial modeling, weather forecasting, molecular dynamics, and computational fluid dynamics (CFD).

Key Features of Azure HBv5

  • Memory Bandwidth Breakthrough: Azure HBv5 delivers an impressive nearly 7 TB/s of memory bandwidth. To put this into perspective, this is up to 8 times higher than other leading cloud and bare-metal options, and nearly 20 times more than Azure’s previous generation, the HBv3.
  • Parameter Configurability: The new VM allows users to configure memory allocation up to 9 GB per core, with a total of 400-450 GB of RAM using cutting-edge HBM3 technology.
  • Powerful CPU Offering: Users can benefit from up to 352 AMD EPYC ‘Zen4’ CPU cores, peaking at 4 GHz, tailored to meet extensive computational requirements.
  • Infinity Fabric: The architecture includes a total Infinity Fabric bandwidth that is double that of any previous AMD EPYC design, optimizing CPU communication.
  • Networking Innovations: Azure HBv5 comes packed with 800 Gb/s NVIDIA Quantum-2 InfiniBand, ensuring robust data movement between compute nodes.
Azure HBv5 Server
(Windows](https://windowsforum.com/#%5B/IMG%5D(%5Burl=%22https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG)%22%5DWindows](https://windowsforum.com/#%5B/IMG%5D(%5Burl=%22https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG)%22%5DWindows%5D(https://windowsforum.com/#%5B/IMG%5D(%5Burl=%22https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG%5D(https://windowsforum.com/#%5B/IMG)%22%5DWindows)) Forum[/url])
Visualize a cutting-edge motherboard crammed with advanced CPUs and memory modules, the heart of Azure's newest offering aimed at pushing the envelope of performance in cloud computing.

Who Can Benefit?

The Azure HBv5 is positioned as a game-changer for research organizations and industries engaged in high-demand calculations and simulations. It addresses the critical needs of:
  • Financial Services: For real-time risk and portfolio management simulations, where split-second decisions are paramount.
  • Life Sciences: Assisting in molecular dynamics, pivotal for drug discovery and biochemistry experiments.
  • Weather and Climate Modeling: Enabling rapid simulations that can predict weather patterns with higher accuracy.

The Bigger Picture: A Step Forward for Cloud Computing

With the introduction of the HBv5, Microsoft isn't just offering a new VM but is redefining the capabilities of cloud-based HPC solutions. The skyrocketing demand for computational resources due to the exponential growth of data, especially in scientific and commercial fields, means that organizations cannot afford to compromise on performance.

The Technological Underpinnings

The backbone of this impressive performance surge comes from:
  • High Bandwidth Memory (HBM): Unlike conventional memory, HBM is stacked memory that offers higher bandwidth while occupying less physical area, making it ideal for complex computations.
  • Custom-designed AMD EPYC CPUs: These processors have been specifically tailored for tasks requiring vast amounts of data to be processed quickly, answering HPC users' call for speed and efficacy.

Looking Ahead: Previews and Pre-Bookings

Excited potential users can sign up for the Azure HBv5 Preview, expected to launch in the first half of 2025. This will pave the way for organizations to experience firsthand the capabilities of this next-generation computing infrastructure.

In Conclusion

As we stride deeper into the era of digital transformation, innovations like the Azure HBv5 VM position Microsoft Azure at the forefront of delivering unparalleled solutions for HPC. By focusing on memory bandwidth and configurability, Microsoft is enabling enterprises to harness the full potential of cloud computing to tackle ever-complex scientific and business challenges.
While every technological leap carries risks and uncertainties, the landscape is rich with opportunities for those ready to embrace cutting-edge capabilities. Are you prepared to navigate this new HPC frontier? Let the conversation begin on WindowsForum.com!

Source: HPCwire Announcing Azure HBv5 Virtual Machines: A Breakthrough in Memory Bandwidth for HPC - HPCwire
 

Last edited:
At the recent Supercomputing Conference 2024 (SC24), the spotlight was firmly on the latest innovation from the Microsoft Azure HPC team: the HBv5 instance, powered by AMD’s MI300C chip. This new instance type is raising eyebrows across the technical community, and for good reason. Featuring an architectural shift that combines cutting-edge technologies, it promises to elevate high-performance computing (HPC) to unprecedented levels. Let’s dive deep into what makes the Azure HBv5 such an exciting development for Windows users and the broader tech ecosystem.

A rack-mounted server unit in a dimly lit data center with blue lighting.
What Is the HBv5 Instance?​

The HBv5 instance represents Microsoft's latest foray into the realm of specialized hardware for HPC applications. It utilizes the AMD MI300C chip, an evolution of AMD’s GPU architecture that combines the computational prowess of Zen 4 cores with the speed of High Bandwidth Memory (HBM). This design is aimed to deliver significantly better performance compared to its predecessor, the Intel Xeon MAX 9480.

Understanding the MI300C Architecture​

What exactly is the MI300C? At its core, this chip revamps the original MI300A architecture by integrating up to 96 cores, though Azure has made the strategic choice to reserve eight of these for overhead management in virtualized environments. SMT (Simultaneous Multithreading) is disabled to deliver optimum performance in computational tasks — a common practice in HPC setups.
This 4-socket design enriches the HBv5 instance with massive capabilities, allowing it to support complex computational tasks that demand superior processing power and memory bandwidth.

Key Features of the HBv5 Instance​

  • High Core Count: The HBv5 allows customers access to up to 352 cores from 384 available, fully optimized for intensive processing tasks.
  • Abundant Memory: Users can access 400-450GB of HBM3 memory, ensuring that applications can run smoothly without memory bottlenecks.
  • Enhanced Bandwidth: The combination of the Infinity Fabric with double the bandwidth of previous generations means data can flow seamlessly between components, a critical demand for HPC workloads.
  • Cutting-Edge Connectivity: Equipped with 200Gbps NVIDIA Quantum-X NDR Infiniband links, the HBv5 instance promises low-latency communication that is essential for data-intensive applications.

Implications for HPC and Windows Users​

The implications of the Azure HBv5 instance are profound, particularly for organizations that rely on intensive computational models, from scientific research to financial modelling and beyond. Its bespoke design can lead to:
  • Increased Efficiency: With such processing power concentrated into virtualized environments, organizations can execute models and simulations that were previously unachievable.
  • Scalability: The ability to add more VMs or scale existing workloads without compromising speed allows businesses to grow without investing heavily in hardware.
  • Cost-Effectiveness: The move to high-density HPC instances can reduce the overall costs associated with managing sprawling server farms.
This mesh of performance enhancements and infrastructure efficiency will likely attract numerous enterprises hoping to leverage advanced computing capabilities.

The Bigger Picture: AMD's Chiplet Architecture​

Understanding the MI300C’s design demands a grasp of AMD’s chiplet architecture strategy, which allows for flexibility and scalability across various applications. By optimizing designs for specific clientele – like Microsoft – AMD underscores its focus on tailoring solutions that not only meet but exceed current market needs. This specialization reflects a broader industry trend where hardware manufacturers move towards creating bespoke solutions for major players rather than mass-market parts.
The innovations represented by the HBv5 and MI300C could well foreshadow a future where customized hardware takes precedence in computing, especially in sectors relying heavily on digital transformation.

Conclusion: Why Does the HBv5 Matter?​

The unveiling of the Microsoft Azure HBv5 instance with the AMD MI300C chip at SC24 is more than just an exciting announcement — it signifies a paradigm shift in high-performance computing. By combining raw processing power with enhanced memory capabilities and reliability, the HBv5 caters squarely to the needs of modern enterprises seeking transformative computing solutions.
As we move deeper into an era dominated by data-centric decision-making, the promise of solutions like Azure HBv5 exemplifies how cutting-edge technology can empower businesses to reach their full potential.
So, whether you are a tech enthusiast, a Windows user, or a business decision-maker, keeping an eye on advancements like the Azure HBv5 will be more than beneficial—it's essential in staying at the forefront of technology in this rapidly evolving landscape. Exciting times lie ahead for Windows and Azure users alike!

Source: ServeTheHome This is the Microsoft Azure HBv5 and AMD MI300C
 

Last edited:
Back
Top