KB5096136: Windows 11 26H1 Servicing Update for AMD Vitis AI Execution Provider

Microsoft has published KB5096136, an automatic Windows Update package for Windows 11 version 26H1 that updates the AMD Vitis AI Execution Provider to version 2.2605.2.0 for systems using Windows machine learning and ONNX Runtime acceleration. That sounds like the sort of servicing footnote most users will never see unless they dig through Update history. It is more important than its size suggests, because it shows how Windows is turning local AI acceleration into a serviced platform layer rather than a driver-side curiosity. The update is also another reminder that Microsoft’s AI PC strategy now depends on a stack of small, vendor-specific components arriving on a cadence administrators will have to understand, inventory, and trust.

Diagram shows ONNX Runtime inference on AMD NPU with local image enhancement and text translation, alongside Windows Update.Microsoft Is Servicing the AI PC One Execution Provider at a Time​

KB5096136 is not a feature update, not a cumulative update, and not the kind of package that promises a new button on the taskbar. It is a component update for the AMD Vitis AI Execution Provider, the layer that lets ONNX Runtime and Windows machine learning workloads hand supported inference jobs to AMD hardware instead of simply falling back to the CPU.
That distinction matters. The modern Windows AI stack is not a single monolithic subsystem that magically detects every neural processor and uses it optimally. It is a chain of runtimes, execution providers, model formats, drivers, silicon-specific libraries, app frameworks, and policy decisions. When one link changes, the user-facing effect may be subtle, but the platform contract changes with it.
Microsoft’s support article is terse by design. It says the update contains improvements for the AMD Vitis AI Execution Provider AI component on Windows 11, version 26H1, requires the latest cumulative update for that Windows version, installs automatically through Windows Update, and replaces KB5089175. The visible sign of success is a line in Settings under Windows Update history: “Windows Runtime ML AMD NPU Execution Provider Update (KB5096136).”
That phrasing is revealing. Microsoft does not present this as an AMD software bundle, a developer preview, or a separate accelerator package. It is a Windows Runtime ML update. The branding tells administrators where Microsoft wants responsibility to sit: inside Windows servicing, even when the implementation is necessarily silicon-specific.

The Execution Provider Is Where AI Hype Meets Hardware Reality​

The term execution provider is easy to ignore because it sounds like plumbing, and in a sense it is. In ONNX Runtime, an execution provider is the adapter that determines where supported parts of a machine-learning model actually run. A model may be expressed in a portable format, but real acceleration depends on mapping operators, memory behavior, precision formats, and scheduling choices to the hardware underneath.
For AMD systems, Vitis AI is the relevant AMD stack for accelerating inference on supported AMD platforms. Microsoft’s own description points at Ryzen AI, AMD Adaptable SoCs, and Alveo data-center accelerator cards as part of the broader target family. In the Windows client context, the most interesting target is Ryzen AI hardware in PCs, where Microsoft and AMD are trying to make NPUs useful to normal applications rather than merely impressive on a spec sheet.
That is the unglamorous but essential layer beneath every local-AI promise. If a Windows app wants to run a model locally for image processing, language assistance, content extraction, camera effects, or search, it should not have to ship a separate acceleration stack for every NPU vendor. The operating system is supposed to provide a common route into the hardware. Execution providers are one of the places where that abstraction becomes real.
They are also where the abstraction leaks. Not every model runs everywhere. Not every operator is supported by every provider. A workload may split across NPU, GPU, and CPU depending on model structure and available support. Developers may see performance or compatibility differences between AMD, Intel, Qualcomm, and Nvidia paths even when using the same high-level Windows ML APIs.
KB5096136 therefore belongs to a category of updates that will rarely produce dramatic release notes but can materially affect whether local AI feels fast, battery-efficient, and reliable. A minor version bump in this layer can matter more to an AI workload than a much louder shell feature.

Windows 11 26H1 Makes This Update Feel Narrower Than It Really Is​

The update’s Windows 11 version target is notable: 26H1. Microsoft has described Windows 11 version 26H1 as a targeted release rather than the next general annual Windows feature update for the broad installed base. Devices on 26H1 have a different servicing story than mainstream 24H2 and 25H2 PCs, and Microsoft has said those systems will not move to the next second-half annual feature update in the usual way.
That makes KB5096136 look, at first glance, like a niche update for a narrow set of machines. In deployment terms, it is. A Windows 11 24H2 or 25H2 user should not assume this specific KB is intended for their device just because they have an AMD processor. Microsoft also published related AI component packages for other Windows versions and provider combinations, which is part of the point: the servicing matrix is becoming more fragmented.
The bigger story is not that every Windows user needs KB5096136. It is that Microsoft is now comfortable shipping AI substrate updates as ordinary Windows Update payloads, segmented by OS release, hardware capability, and provider. That is a quiet architectural shift for the PC.
For decades, Windows administrators mostly thought of hardware enablement as a mixture of inbox drivers, OEM driver packages, firmware, and occasional feature packs. AI PCs add a new layer: model runtimes and execution providers that may be neither traditional device drivers nor normal applications. They sit in the middle, and they will have to be maintained with the same seriousness as graphics drivers or security-sensitive codecs.

The Automatic Install Is the Policy Statement​

Microsoft says KB5096136 downloads and installs automatically from Windows Update. That may be the most consequential line in the support note.
Automatic installation signals that Microsoft does not want AI execution providers treated as optional developer components. If a device is eligible and current on cumulative updates, the platform layer should be brought forward without a user having to know what Vitis AI is. That makes sense for reliability and consistency. An application developer targeting Windows ML does not want to diagnose a user’s stale execution provider before a model can run properly.
For consumers, the benefit is simplicity. If a Copilot+ PC or AI-capable Windows device gains better local inference behavior, the improvement can arrive as part of routine servicing. The user may never know which provider made the difference, and arguably should not need to.
For enterprises, automatic installation cuts both ways. The less visible a component is, the more important inventory and change control become. An execution provider can affect application behavior, performance characteristics, power consumption, and potentially security posture. That does not mean KB5096136 is suspicious; it means this class of update deserves to be tracked.
The prerequisite reinforces the same model. Microsoft requires the latest cumulative update for Windows 11 version 26H1 before KB5096136 installs. That keeps the AI layer tied to the underlying OS servicing baseline, reducing the odds that a new provider lands on an older platform state that lacks required runtime pieces.

AMD’s Role Is Strategic, Not Merely Supportive​

AMD’s presence in this update is not incidental. The AI PC market is now a three-way contest between silicon vendors trying to make their NPUs relevant and an operating system vendor trying to abstract those differences without erasing them. Qualcomm has pushed Windows on Arm hard through Snapdragon X-class chips. Intel has made NPU acceleration part of its Core Ultra story. AMD has Ryzen AI and a broader Vitis AI ecosystem.
Microsoft needs all of them. A Windows AI platform that works well only on one vendor’s hardware would be commercially awkward and technically fragile. A platform that tries to hide every silicon difference too aggressively risks leaving performance on the table. Execution providers are the compromise: common app-facing frameworks above, vendor-tuned acceleration below.
For AMD, Vitis AI gives the company a path to make its hardware visible to Windows applications without forcing every developer to become an AMD specialist. That is especially important because AI acceleration is not just about peak TOPS numbers. It is about whether useful models can run efficiently, repeatedly, and predictably in the applications users already have open.
This is where version numbers like 2.2605.2.0 become part of the competitive story. They are not marketing slogans, but they are evidence of cadence. If Microsoft and AMD can update the provider regularly, fix compatibility problems, improve performance, and align with Windows ML changes, AMD’s AI hardware becomes more credible as a platform target.

The Update History Line Is Now an Admin Artifact​

Microsoft tells users to verify installation by opening Settings, going to Windows Update, then Update history, and looking for “Windows Runtime ML AMD NPU Execution Provider Update (KB5096136).” That instruction is mundane, but it points to a growing operational problem: the Windows AI stack is becoming something administrators will need to audit.
In the old model, a help desk might ask for the Windows build, GPU driver version, and BIOS revision. In the AI PC model, the relevant state may include Windows version, cumulative update level, NPU driver, execution provider version, app SDK version, model package version, and policy configuration. A failed local AI feature may not be “Copilot is broken.” It may be that the app is targeting a provider capability the device has not yet received.
The industry has been here before. GPU compute, media acceleration, and browser hardware acceleration all became troubleshooting surfaces once applications started depending on them. AI acceleration is likely to follow the same path, but with more vendors, more privacy expectations, and a faster cadence.
The good news is that Windows Update history gives at least one user-visible breadcrumb. The bad news is that Update history is not the same thing as fleet observability. Enterprises will want this information exposed clearly through management tools, reporting pipelines, and support diagnostics. If AI components are going to be serviced like platform dependencies, they need to be visible like platform dependencies.

Local AI Needs Boring Updates More Than Grand Demos​

The PC industry has spent the last two years selling AI hardware through demos: background blur, image generation, semantic search, document recall, text assistance, and live effects. Demos matter, but they are not what makes a platform durable. Durable platforms are built through boring updates that fix edge cases, improve compatibility, and keep developers from abandoning the abstraction.
KB5096136 is one of those boring updates. It does not claim a headline feature. It does not promise a new Copilot experience. It does not publish a long list of benchmark improvements. It simply advances the AMD Vitis AI Execution Provider for eligible Windows 11 26H1 systems.
That restraint should not be mistaken for irrelevance. Local AI is full of hidden dependencies. If a model runs on the CPU instead of the NPU, battery life may suffer. If an execution provider mishandles an operator, an app may fall back silently or fail loudly. If provider versions vary too widely across the installed base, developers will target the lowest common denominator or avoid the stack entirely.
In that context, regular provider updates are not maintenance after the fact. They are a precondition for the platform becoming useful. The AI PC only works as a category if the software stack improves after the hardware ships.

Microsoft’s AI Component Model Is Becoming Its Own Servicing Lane​

Microsoft’s release information for Windows AI components shows a broader pattern: execution providers and other AI components are being tracked with their own release history. That puts components such as AMD Vitis AI, AMD MIGraphX, Intel OpenVINO, Nvidia TensorRT-RTX, and Qualcomm QNN into a recognizable servicing lane.
This is a departure from the mental model most Windows users still have. A Windows update used to be thought of as an operating system patch, a driver update, a Defender definition, or an app update. AI components blur those boundaries. They are OS-adjacent, hardware-specific, developer-facing, and user-impacting all at once.
The naming also creates friction. KB5096136 updates AMD Vitis AI Execution Provider version 2.2605.2.0, while Microsoft’s Windows ML documentation may discuss execution providers in terms of MSIX package versions, runtime tracks, and SDK compatibility. That is manageable for developers, but confusing for everyone else.
Microsoft will need to make this simpler without hiding too much. If an app says a local AI feature requires an updated execution provider, users should not have to decode three versioning schemes. If an administrator blocks or defers a component, they should know what breaks. The servicing lane is emerging; the operational language around it is still catching up.

The Replacement of KB5089175 Shows the Cadence Is Monthly, Not Occasional​

KB5096136 replaces KB5089175, the prior AMD Vitis AI Execution Provider update. That replacement detail is easy to skim past, but it suggests a cadence. These are not one-time enablement packages issued at hardware launch and then forgotten. They are part of an ongoing rhythm.
A monthly or near-monthly rhythm has advantages. AI frameworks and hardware runtimes are moving quickly, and waiting for annual Windows feature updates would be too slow. Provider updates can ship improvements closer to the pace of silicon enablement and developer adoption.
The risk is churn. If execution providers change frequently, developers and IT teams need to know whether updates are additive, breaking, security-relevant, or performance-oriented. Microsoft’s KB text for this update uses the generic phrase “includes improvements,” which is serviceable for consumers but thin for professionals.
That thinness is not unique to Microsoft. Vendors often keep low-level release notes short because the meaningful details are obscure, proprietary, or only relevant to a narrow developer audience. But as AI acceleration becomes a mainstream Windows dependency, “improvements” will not always be enough. Administrators will increasingly ask what changed, why it matters, and whether a given app workload should be retested.

Developers Get a Cleaner Target, but Not a Free Ride​

For Windows developers, the promise of Windows ML and ONNX Runtime acceleration is portability. Build against supported APIs and model formats, and let the platform choose the best available hardware. In theory, that reduces the need to ship separate code paths for every NPU vendor.
In practice, developers still need to test. Execution providers vary in supported operators, performance behavior, precision tradeoffs, and failure modes. An app that performs well through one provider may behave differently through another. A model that accelerates cleanly on one silicon family may split execution across devices elsewhere.
KB5096136 does not eliminate that complexity. It simply updates one provider in one Windows servicing context. But updates like this are the mechanism by which the abstraction gets better. Every provider revision gives Microsoft and AMD a chance to narrow gaps between what developers expect and what the hardware can actually execute.
The best outcome is boring: apps ask Windows for acceleration, Windows routes work to AMD’s NPU where appropriate, and the user gets faster or more efficient local inference without caring about the stack. The worst outcome is fragmentation disguised as abstraction. Microsoft’s servicing discipline will determine which version Windows users experience.

Security and Privacy Are Part of the Same Story​

Local AI is often sold as a privacy improvement because data can stay on the device rather than being sent to a cloud service. That argument is valid as far as it goes, but local processing does not remove the need for trust. It moves trust into the local stack.
An execution provider sits close to sensitive workloads. It may process images, text embeddings, audio features, document content, or application data depending on what the model is doing. Bugs in this layer could cause crashes, incorrect results, information exposure, or privilege-boundary concerns, depending on architecture and integration.
That does not mean KB5096136 is a security update. Microsoft’s article does not present it as one. But the category deserves security-minded attention because accelerated local AI is becoming part of the trusted computing base for user experiences that handle personal data.
This is another reason automatic Windows Update delivery is both helpful and consequential. Keeping these components current reduces the risk of long-lived stale runtimes. At the same time, organizations with strict validation processes will want a way to understand and stage these updates without losing control of the AI features their users depend on.

The 26H1 Split Foreshadows a Messier Windows Future​

Windows 11 version 26H1 is not just another label in this story. It is a sign that Windows servicing is becoming more conditional on hardware generations and platform needs. Microsoft’s historical pitch for Windows has been broad compatibility: one Windows line serving a sprawling ecosystem. AI PCs pressure that model.
New NPUs, new Arm cores, new power-management requirements, and new local model workloads all create reasons to tune Windows for specific hardware classes. That may be technically sensible. It may also make the Windows version landscape harder to explain.
KB5096136 lives inside that tension. It is a small provider update, but it targets a Windows release with a special place in the roadmap. Existing mainstream PCs may receive analogous AI component updates under different KB numbers, while 26H1 devices follow their own path. The more Microsoft does this, the more important clear documentation becomes.
The danger is not that Windows has multiple servicing tracks. It already does. The danger is that users and administrators lose the ability to tell which track they are on, which AI capabilities are supported, and which updates are expected. AI features are already hard enough to describe without version-line ambiguity.

The Practical Read for WindowsForum Readers​

For enthusiasts, KB5096136 is a signal to stop treating AI PC support as a single checkbox in a spec sheet. The NPU matters, but so does the execution provider that makes it usable. If you are testing local AI workloads on AMD hardware, provider version should be part of your notes.
For sysadmins, the update is a reminder that Windows Update is now carrying more AI-specific payloads. These packages may not reboot the world or change the Start menu, but they can alter the behavior of local inference workloads. If your organization is piloting AI PCs, you should start collecting this component state before users file tickets that are impossible to reproduce.
For developers, the message is more nuanced. The Windows ML path is becoming more real, but it remains a moving target. That is not necessarily bad. A young platform that updates regularly is healthier than one frozen at launch. But applications that depend on local acceleration need diagnostics, fallbacks, and test coverage across vendors.
And for ordinary users, the answer is simple: if KB5096136 appears in Update history on an eligible Windows 11 26H1 device, it is expected. It is not a general Windows feature upgrade, and it is not something most people need to manually seek out.

The Small KB That Explains the AI PC’s Maintenance Burden​

KB5096136 is a compact update with a larger lesson: the AI PC is not a product you buy once, but a stack Microsoft and its silicon partners must keep servicing. The update’s concrete facts are straightforward, but their implications reach into deployment, development, support, and trust.
  • KB5096136 updates the AMD Vitis AI Execution Provider to version 2.2605.2.0 for Windows 11 version 26H1 systems.
  • The update is delivered automatically through Windows Update and requires the latest cumulative update for Windows 11 version 26H1.
  • The package replaces KB5089175, showing that AMD’s Windows AI execution layer is already on an active servicing cadence.
  • Users can confirm installation in Windows Update history under the Windows Runtime ML AMD NPU Execution Provider entry.
  • The update matters most for workloads that use ONNX Runtime or Windows machine learning to accelerate inference on supported AMD hardware.
  • IT teams testing AI PCs should begin tracking execution provider versions alongside Windows builds, drivers, firmware, and application versions.
The useful way to read KB5096136 is not as a dramatic new feature, but as a maintenance receipt for Microsoft’s larger bet: that Windows can become the broker between AI applications and an increasingly diverse hardware ecosystem. If that bet succeeds, users will experience local AI as something that simply works and improves quietly over time. If it fails, the AI PC will become another compatibility maze of vendor runtimes, version mismatches, and features that work only in demos. For now, the quiet arrival of AMD’s updated execution provider suggests Microsoft understands that the future of on-device AI will be won less by keynote moments than by the invisible discipline of servicing the stack underneath them.

References​

  1. Primary source: Microsoft Support
    Published: Tue, 26 May 2026 21:02:38 Z
  2. Related coverage: windowsforum.com
  3. Official source: learn.microsoft.com
 

Back
Top