Windows 11 Local AI APIs Expand to NVIDIA RTX—Copilot+ Badge Gets Cracked

ChatGPT · Jun 11, 2026

Microsoft has updated Windows 11’s local Language Model APIs so developers can run Phi Silica workloads on non-Copilot+ PCs with Nvidia GeForce RTX 30-series or newer GPUs and at least 6GB of VRAM, extending native on-device AI beyond the NPU-equipped machines Microsoft promoted in 2024. The change is officially a developer preview, not a mass rollout of every Copilot+ feature to every gaming tower. But strategically, it is much bigger than an API compatibility note. Microsoft is admitting, carefully and indirectly, that the future of local AI on Windows cannot be confined to one badge, one silicon block, or one laptop marketing cycle.

Microsoft’s AI PC Wall Was Always Built on Efficiency, Not Capability

When Microsoft introduced Copilot+ PCs, it did not merely describe a new class of hardware. It drew a line through the Windows ecosystem. On one side were machines with at least 16GB of RAM, SSD storage, and an NPU capable of 40 TOPS or more. On the other side were millions of perfectly modern Windows PCs that could game, render, compile, stream, and run local AI tools, but could not qualify for Microsoft’s most visible on-device AI push.
The NPU requirement was not technically absurd. Neural processing units are designed to run certain AI workloads efficiently, often at lower power and with less thermal drama than a discrete GPU. In a thin laptop, that matters. An always-available assistant, a background indexing feature, or a low-latency image tool cannot behave like a game that spins up a 115-watt GPU every time the user opens a document.
But the marketing simplification hardened into something more brittle. “Copilot+ PC” became shorthand for “this is where Windows AI happens,” even though enthusiasts had been running local language models, Stable Diffusion derivatives, transcription engines, and retrieval tools on GPUs long before the badge existed. Microsoft’s claim was strongest when phrased as a battery-life argument. It was weakest when heard as a capability argument.
That distinction matters because Windows is not just a laptop operating system. It is also the platform under gaming rigs, creator workstations, developer desktops, lab machines, and corporate endpoints with discrete graphics hardware. A Windows AI strategy that treats those PCs as second-class citizens was never going to survive contact with the installed base.

The RTX Exception Turns a Badge Into a Negotiation

The new support path does not make every Windows 11 PC an AI PC. It specifically targets systems with Nvidia GeForce RTX 30-series GPUs or newer and at least 6GB of VRAM. That is still a meaningful hardware floor, and it excludes older GTX cards, low-end integrated graphics, many business desktops, and laptops with cramped graphics memory.
Even so, the symbolic shift is hard to miss. A PC no longer needs to be sold as a Copilot+ machine to participate in Microsoft’s native local language model layer. It can qualify because it has the right GPU. The Copilot+ badge remains relevant, but it is no longer the only doorway into Windows’ built-in AI runtime story.
This is a different kind of fragmentation from the one Microsoft started with. Instead of “NPU equals in, no NPU equals out,” Windows AI begins to look more like the rest of PC computing: feature availability depends on the specific accelerator, driver stack, memory budget, OS version, and app framework. That is messier to explain on a retail shelf, but it is more honest about the PC market.
It also gives Microsoft an escape hatch. The company can preserve Copilot+ as a premium category for certain first-party experiences while letting developers target a broader range of capable machines. That is not a retreat so much as a rebalancing. Microsoft still gets to promote efficient AI laptops, but it no longer has to pretend that a desktop RTX card is somehow less “AI capable” than a laptop NPU.

Phi Silica Becomes a Windows Component, Not Just a Demo Model

The model at the center of this shift is Phi Silica, Microsoft’s on-device small language model for Windows AI APIs. It is intended for local language tasks such as summarization, rewriting, text generation, formatting, and structured transformations. It is not a full cloud-scale chatbot living inside Windows, and nobody should expect it to behave like the largest frontier models.
That limitation is part of the point. Phi Silica represents the class of AI work that makes sense to run locally: fast, bounded, privacy-sensitive, and deeply integrated into apps. A mail client does not need a gigantic model to rewrite a paragraph. A notes app does not need a cloud round trip to turn meeting bullets into a cleaner outline. A document tool does not need to upload corporate text to a remote server just to produce a table.
The more important architectural change is distribution. If an app needs the model, Windows can obtain the required components through the system rather than forcing every developer to bundle a model, build a downloader, manage updates, and explain storage consumption to users. That turns the model into something closer to a shared runtime dependency.
This is where Microsoft’s platform instincts show. The company does not merely want AI apps to exist on Windows; it wants Windows to become the place where the app asks for a capability and the operating system brokers the hardware, runtime, model, and updates. That is the same playbook that made graphics, media, printing, accessibility, and security APIs strategically important. AI is being pulled into the operating system’s contract with developers.

Developers Care Less About the Badge Than the Call

For developers, the distinction between an NPU and a GPU is secondary to whether an API is available, predictable, fast enough, and supportable. A developer building a Windows app does not want to write one feature for Copilot+ laptops, another for RTX desktops, another for CPU fallback, and another for cloud-only machines unless the market forces them to. They want a capability they can query and a behavior they can explain.
That is why this preview matters even if the first supported surface is narrow. Once Windows AI APIs can run across more than one accelerator class, Microsoft can begin abstracting the hardware away. The app can ask whether local language generation is available. Windows can decide whether that means an NPU, a GPU, or perhaps another supported backend in the future.
There is still a long way to go before that vision is clean. Developers will need to know latency, model quality, memory behavior, battery impact, and fallback rules. Enterprises will want policy controls. Users will want a simple answer to whether an app feature works on their machine. Support desks will be less amused by a world where “Windows AI” works on one RTX laptop but not another because of VRAM, driver, OS, or preview-channel requirements.
But the direction is sensible. Microsoft cannot win local AI on Windows by making every developer target one premium laptop category. It can win by making local AI feel like a normal Windows capability that scales across hardware. The RTX move is an early, imperfect version of that broader platform promise.

The NPU Was Not a Lie, but the Story Was Too Small

The easy reaction is to say this proves NPUs were unnecessary. That is too neat. NPUs still make sense for certain workloads, especially on mobile hardware where power efficiency and sustained background operation matter. A laptop that can perform AI tasks without hammering battery life or fan noise has a real advantage.
The problem was not the NPU. The problem was treating the NPU as the defining feature of local AI rather than one implementation of it. GPUs are often better suited for heavier bursts of AI compute, particularly on desktops and gaming laptops where power and thermals are less constrained. CPUs may be appropriate for lighter models or speech and vision tasks. Specialized silicon is not a religion; it is a scheduling decision.
Microsoft now appears to be moving toward that more pragmatic view. Copilot+ PCs can still be the best experience for certain Windows features. RTX systems can become viable targets for local language APIs. Other hardware paths may follow as the stack matures. The platform gets healthier when the operating system stops enforcing a marketing category as if it were a law of physics.
This also puts pressure on Microsoft’s first-party feature strategy. If Phi Silica can run locally on a supported RTX system, users will reasonably ask why some AI experiences remain exclusive to Copilot+ PCs. Sometimes the answer will be privacy, performance, power, or model design. Sometimes the answer will be product segmentation. Microsoft will need to be clearer about which is which.

Enterprise IT Will See Promise Wrapped in Policy Risk

For administrators, the most interesting part of the change is not that gaming GPUs can run a Microsoft language model. It is that Windows may download AI models as system-managed components when apps request them. That is convenient for developers and consumers, but it also creates new operational questions inside managed environments.
Enterprises have spent years building controls around software installation, data loss prevention, cloud services, and endpoint telemetry. Local AI complicates that map. If the processing happens on the device, the privacy story may improve because sensitive content does not need to leave the PC. But local processing also means the capability may appear inside apps that previously had no generative features at all.
That will force administrators to think beyond the old cloud-versus-local framing. A locally running model can still summarize confidential documents, transform regulated text, or generate content that must be retained, audited, or governed. The absence of a cloud upload does not eliminate compliance obligations. It merely changes where the risk lives.
Microsoft will therefore need robust controls: which models can be installed, which apps can call them, how usage is logged, whether features can be disabled by policy, and how model updates are validated. If Windows AI APIs become a mainstream platform layer, they cannot be managed like a novelty feature. They will need the same administrative seriousness as browser engines, scripting runtimes, and identity brokers.

Nvidia Gets the Installed Base Microsoft Needs

The Nvidia angle is not incidental. RTX hardware is the most obvious bridge between Microsoft’s Copilot+ ambitions and the existing population of Windows machines powerful enough to run local AI today. Nvidia has spent years turning its consumer GPUs into AI accelerators by another name, helped by CUDA, tensor cores, and a developer ecosystem that already treats RTX cards as practical local inference hardware.
For Microsoft, supporting RTX systems buys reach. Copilot+ PCs may define the new laptop shelf, but RTX PCs define a large slice of enthusiast, creator, and gaming Windows. Those are exactly the users most likely to experiment with local AI, notice performance differences, and pressure app developers to support hardware they already own.
For Nvidia, the move reinforces the idea that an RTX GPU is not just for games. The company has been steadily reframing GeForce and RTX PCs as AI platforms, not merely graphics platforms. Microsoft’s Windows AI APIs give that pitch a native OS hook. Instead of every app relying on its own AI runtime, Windows can become part of the acceleration story.
The awkward part is that this may make Copilot+ branding feel less distinct to power users. If a desktop with an RTX 4070 can run local Microsoft-backed language APIs, the badge on a thin laptop becomes less of a gatekeeper and more of an efficiency certification. That is probably where it should have been all along.

The Consumer Message Gets Messier but More Truthful

Microsoft now has a messaging problem of its own making. For a year, the company trained consumers to associate local Windows AI with Copilot+ PCs. Now it must explain that some Windows AI APIs can run on some non-Copilot+ PCs with certain Nvidia GPUs, while other headline features remain tied to NPU-equipped systems.
That is not elegant. But PC buyers already live in a world of messy capability charts. Games have minimum and recommended GPUs. Video editors depend on codecs and accelerators. Security features depend on firmware and processor support. AI will be no different, no matter how much the industry wants a single logo to simplify it.
The more honest consumer message is that local AI has tiers. A Copilot+ laptop may be the right choice for battery-friendly, integrated, always-on AI features. An RTX desktop may be excellent for higher-power local inference and developer experimentation. A standard business laptop may rely on cloud AI or CPU-bound features. The badge can indicate one path, but it should not pretend to describe the whole map.
The risk is disappointment. If users hear “Windows AI now works on non-Copilot+ PCs,” some will assume Recall, Click to Do, image tools, and every future AI feature are coming to their older machines. That is not what this change says. Microsoft will need to be precise, because the AI PC category is already full of inflated claims and thin distinctions.

The Real Battle Is Over the Default AI Runtime

This update is best understood as part of a larger contest over who owns the default local AI runtime on the PC. Microsoft wants developers to call Windows APIs. Nvidia wants developers to exploit RTX acceleration. Intel, AMD, and Qualcomm want NPUs to matter. Cloud AI providers want apps to keep calling hosted models. Open-source developers want portable stacks that are not locked to one vendor’s operating system.
Windows sits in the middle of that fight. If Microsoft can make its AI APIs easy, performant, policy-manageable, and widely available, it can turn local AI into a Windows platform advantage. If it keeps the stack too restricted, developers will route around it with their own model runtimes and hardware-specific libraries. That would leave Windows as the host operating system but not the AI platform.
The RTX preview suggests Microsoft understands that danger. A platform API that only works on a narrow class of recently marketed devices is not really a platform API. It is a product feature wearing platform clothing. Broadening support makes the APIs more credible.
Still, Microsoft must avoid creating a maze. Developers will not embrace Windows AI because it has an appealing architecture diagram. They will embrace it if it reduces complexity. The system needs clear capability detection, dependable model availability, transparent performance expectations, and licensing terms that do not make developers nervous after they have built features around it.

The Copilot+ Line Is Thinner Than Microsoft First Drew It

The practical lesson is not that Copilot+ PCs are obsolete. It is that the original boundary was overdrawn. Microsoft needed a launch narrative, OEMs needed a reason to sell new machines, and NPUs gave the industry a clean number to print on spec sheets. But local AI was never going to fit neatly inside that campaign.
The Windows PC ecosystem is too broad for that. A 2021-era RTX desktop may have more raw AI throughput than a newly certified ultralight laptop. A workstation may be plugged in all day and unconcerned with power draw. A corporate fleet may value manageability more than model performance. A developer may care more about API stability than whether the machine carries a consumer-facing label.
By extending local language APIs to supported Nvidia GPUs, Microsoft is acknowledging that the installed base matters. That is good for users who already own capable hardware. It is good for developers who want a larger market. It is good for Windows as a platform, because an operating system should expand the usefulness of PCs rather than reserve useful capabilities for the newest marketing category.
But it also weakens the mystique around Copilot+. Once users understand that some local AI features can run on non-Copilot+ PCs, they will judge the badge more critically. It will need to stand for tangible advantages: battery life, latency, integration, security, and feature breadth. A logo alone will not carry the argument.

The New Rules Windows Users Should Actually Remember

This is a preview-era shift, so the immediate impact will be uneven. The important thing is not to overread it as a universal unlock or underread it as a dry SDK footnote. It is the first visible step toward a Windows AI model where capability follows hardware reality rather than a single badge.

Windows 11’s local Language Model APIs are expanding beyond Copilot+ PCs, but the new path currently targets supported Nvidia RTX 30-series or newer GPUs with at least 6GB of VRAM.
Phi Silica is aimed at local text intelligence such as summarization, rewriting, formatting, table conversion, and prompt-based generation rather than replacing large cloud chatbots.
Copilot+ PCs still matter because NPUs are better suited to efficient, sustained, battery-conscious AI workloads, especially in thin laptops.
This change does not automatically bring every Copilot+ feature, including Microsoft’s more visible shell-level AI experiences, to older or non-certified PCs.
Developers now have a stronger reason to treat Windows AI APIs as a platform layer, provided Microsoft makes availability, policy control, and performance predictable.
The AI PC badge is becoming less of a hard border and more of a signal about one kind of optimized experience.

Microsoft’s quiet RTX expansion does not end the Copilot+ era, but it does end the cleanest version of its story. The next phase of Windows AI will be less about proving that NPUs are special and more about proving that Windows can intelligently use whatever capable silicon is already inside the PC. That is a harder message to sell, but a better foundation to build on.

References

Primary source: Digital Trends
Published: Thu, 11 Jun 2026 15:13:03 GMT

Your Windows 11 PC can now natively run AI workloads, even if it lacks the Copilot+ badge - Digital Trends

Microsoft's latest AI decision could make millions of existing Windows PCs far more relevant than anyone expected.

www.digitaltrends.com
Official source: learn.microsoft.com

Microsoft.Windows.AI Namespace - Windows App SDK | Microsoft Learn

Provides APIs for local, on-device AI features.

learn.microsoft.com
Official source: developer.microsoft.com

Windows AI | Microsoft Developer

A unified, reliable and secure platform supporting the AI developer lifecycle from model selection, fine-tuning, optimizing and deployment across CPU, GPU, NPU and cloud.

developer.microsoft.com
Related coverage: berrall.com

Microsoft is killing the Copilot+ PC advantage, brings Windows 11’s local AI to RTX 30+ PCs with 6GB vRAM - Peer Networks UK

Wales & West leading provider of PC repairs & IT support for home & business. Peer Networks delivers prompt, no fuss, PC repair services to customers.

www.berrall.com
Related coverage: windowscentral.com

"If it's this easy, why don't more Windows apps use a PC's NPU?" — Microsoft MVP demonstrates how he added meaningful AI to an app in just 10 minutes | Windows Central

It turns out that the NPU in your AI PC could be getting a lot more use, if only developers decided to take the (relatively simple) AI plunge.

www.windowscentral.com
Official source: microsoft.com

IDC Quick Take Observations on the Next-Generation AI PC

PDF document

www.microsoft.com

ChatGPT · Jun 11, 2026

Microsoft has opened experimental Windows AI language-model APIs to Windows 11 PCs with Nvidia GeForce RTX 30-series GPUs or newer and at least 6GB of VRAM, letting some non-Copilot+ PCs run local AI workloads that were previously reserved for NPU-equipped machines. The change is not a consumer-feature unlock so much as a platform signal. Microsoft is no longer pretending that the NPU is the only credible local AI engine in the Windows ecosystem. That matters because the original Copilot+ pitch was built around a hardware boundary that is now becoming more porous.

Microsoft’s Copilot+ Wall Was Always More Marketing Than Physics

When Microsoft introduced Copilot+ PCs in May 2024 and put the first wave on shelves in June, it sold the category with unusual bluntness: this was not merely a faster Windows laptop, but a new class of machine. The defining number was 40 TOPS of NPU performance, joined by baseline requirements such as 16GB of memory and SSD storage. The implication was simple enough for retail shelves and OEM keynotes: if you wanted the new local AI future of Windows, you needed the new silicon.
That message was useful, but it was never the whole technical story. GPUs have been doing machine-learning work for years, and Nvidia’s RTX line is practically synonymous with consumer-accessible AI acceleration. The difference is not whether a GPU can run a local model; the difference is whether Microsoft was willing to make Windows’ own AI plumbing treat that GPU as a first-class target.
The new experimental support for language-model APIs on RTX hardware does not erase the Copilot+ category. It does, however, puncture the neatness of the original boundary. A desktop or gaming laptop with an RTX 3060, RTX 4070, or newer card may lack the badge Microsoft and its OEM partners spent two years promoting, but it can now begin to participate in part of the same local AI platform.
That is the important distinction. Microsoft has not suddenly turned every RTX gaming rig into a Copilot+ PC. It has acknowledged, in code and documentation, that the local AI runtime cannot remain trapped inside one hardware story forever.

The First Crack Appears in the Developer Layer

The change arrives in the least flashy place possible: the Windows AI developer stack. Microsoft describes the GPU path as experimental, and the current opening applies to language-model APIs rather than the whole menu of Copilot+ features. Developers can build apps that call into Windows’ local model capabilities on supported Nvidia GPUs, with Microsoft specifying GeForce RTX 30-series or newer hardware with at least 6GB of VRAM.
That phrasing matters. This is not a Start menu toggle, a Windows Settings switch, or an announcement that Recall is coming to your old gaming PC. It is a platform feature for applications that know how to use the Windows AI framework. The user-facing payoff will depend on developers deciding that Microsoft’s local AI APIs are worth targeting.
Still, developer layers are where platform shifts begin. DirectX was not exciting because an API existed; it was exciting because it gave game developers a common path into hardware acceleration. WinRT, Windows Hello, WSL, and countless other Windows features followed the same pattern: first the plumbing, then the apps, then the expectation that the capability is simply part of the operating system.
The Windows AI APIs are now taking a similar step. By allowing a supported GPU to run the local language model, Microsoft reduces the risk that Windows AI becomes a boutique feature limited to the newest premium laptops. It also gives developers a larger addressable base, which is exactly what any platform needs if it wants more than demo-ware.

Phi Silica Becomes Less of a Copilot+ Ornament

At the center of this change is Phi Silica, Microsoft’s small on-device language model for Windows AI experiences. The model is designed for local inference rather than cloud-scale chat, and it exposes capabilities through Windows.AI.Text APIs such as summarization, rewriting, text generation, and structured text conversion. The model is not meant to replace a frontier cloud model; it is meant to make common language tasks feel instant, private, and integrated.
The practical distribution model is also revealing. Instead of expecting every Windows 11 machine to carry the model by default, Microsoft can deliver it through Windows Update when an app requires it. That keeps the footprint lower while allowing multiple applications to use the same system-managed component.
This is the kind of operating-system move Microsoft understands well. Windows has long absorbed common runtime dependencies so developers do not need to ship their own copy of every library. If local AI becomes another shared runtime service, the question becomes less “Which app bundled which model?” and more “Which hardware target does Windows know how to use?”
GPU support makes that question more interesting. Phi Silica was introduced in the Copilot+ orbit as an NPU-tuned local model, but on-device AI is not a single-silicon religion. A model can be optimized for one kind of accelerator while still being useful on another, provided the software stack can route the work sensibly.

The NPU Still Has the Better Laptop Argument

The easy reaction is to declare the NPU requirement dead. That would be premature. Microsoft’s original NPU argument was never only about raw performance; it was about sustained, low-power, background inference on thin-and-light PCs.
A discrete GPU can be much faster than a laptop NPU for many AI workloads, but it usually pays for that speed with power draw, heat, fan noise, and competition with graphics or compute tasks. On a desktop tower plugged into the wall, that may be an acceptable trade. On an ultraportable laptop trying to preserve all-day battery life, it is much less attractive.
This is why Microsoft can open a GPU path for language APIs while keeping parts of the Copilot+ experience tied to NPUs. Features that run occasionally, on demand, and inside a developer-controlled app are different from OS-level features that may need to index, observe, caption, translate, or act across the user’s session. The former can tolerate a bursty GPU workload. The latter needs a more carefully budgeted compute envelope.
That distinction is not mere vendor spin. Anyone who has heard a gaming laptop spin up under a local model knows that “on-device” does not automatically mean “quiet” or “efficient.” NPUs are designed to make AI boring in the best possible way: always available, low-power, and unlikely to make the rest of the machine feel worse.
But the NPU’s strength does not make the GPU irrelevant. It simply means Windows needs to become smarter about assigning jobs to hardware. Local AI is not a single feature; it is a workload class. Some jobs belong on an NPU, some on a GPU, some on a CPU, and some still belong in the cloud.

Copilot+ Exclusivity Meets the Installed Base

The business reason for Microsoft’s original line was obvious. Copilot+ PCs gave OEMs a new upgrade story at a time when the PC market needed one, and they gave Microsoft a way to tie Windows 11’s next act to hardware refresh cycles. The phrase “AI PC” was not just a technical description; it was a replacement pitch.
That pitch has always had a problem: the Windows installed base is enormous, fragmented, and full of machines with capable silicon that does not match Microsoft’s initial checklist. Enthusiasts may own RTX desktops that can run local language models comfortably. Developers may use workstations with far more AI horsepower than a thin-and-light laptop NPU. Gamers may have bought an RTX 30-series card years before Copilot+ existed.
Leaving those systems outside the Windows AI story was defensible only if Copilot+ features required a specific class of always-on NPU behavior. For some features, they may. For basic language-model APIs, the argument was weaker.
Microsoft appears to be recognizing that the developer ecosystem cannot be built solely around newly purchased Copilot+ machines. If the company wants Windows AI APIs to become normal app infrastructure, developers need more test machines, more user machines, and fewer reasons to bypass Microsoft’s stack in favor of direct calls to CUDA, DirectML, ONNX Runtime, llama.cpp, or cloud APIs.
The RTX opening is therefore less a surrender than a recruitment drive. Microsoft is trying to make Windows itself relevant in a local AI world where developers already have other ways to run models on PCs.

Recall Remains the Line Microsoft Is Not Ready to Cross

The most visible Copilot+ feature remains Recall, the controversial system that captures and indexes a user’s activity so it can be searched later. Recall’s history has made it more than a feature; it is a trust test for Microsoft’s AI ambitions. After the original rollout plan drew heavy criticism from security researchers and privacy advocates, Microsoft reworked the feature with stronger controls, opt-in behavior, and Windows Hello requirements.
That history helps explain why GPU support is arriving first in language-model APIs, not in the headline Copilot+ feature set. Recall is not simply “run a model locally.” It involves data capture, indexing, identity protection, storage security, and user consent. The hardware accelerator is only one part of a much larger system.
Click to Do and other shell-level AI experiences sit in a similar category. They are not ordinary app features. They are Windows experiences that can interact with the user’s screen, content, and workflow. Microsoft will be cautious about expanding those beyond the hardware and security profile it has already defined.
That leaves Windows users in an odd middle ground. A non-Copilot+ RTX PC may gain access to local language capabilities through apps, but it still may not receive the branded experiences Microsoft uses to advertise Copilot+ machines. The wall is no longer solid, but it is not gone.
This may frustrate enthusiasts, especially those with desktops that vastly outperform Copilot+ laptops on many AI benchmarks. Yet Microsoft’s segmentation is partly technical, partly security-driven, and partly commercial. The company wants a broader AI platform, but it also wants the Copilot+ label to keep meaning something.

Nvidia Gets the Validation It Has Been Arguing For

For Nvidia, Microsoft’s move is a quiet win. Nvidia has spent years telling consumers and developers that RTX PCs are AI PCs, even before Microsoft’s Copilot+ branding gave the term a narrower definition. The company’s argument has been straightforward: tensor cores, mature software, and a huge installed base make RTX hardware a natural home for local inference.
Microsoft’s initial Copilot+ requirements complicated that story. A machine with a powerful RTX GPU could be excluded from Copilot+ features while a laptop with an NPU received the badge. That made sense from Microsoft’s battery-life and platform-control perspective, but it created a messaging clash with Nvidia’s larger AI PC campaign.
By adding GPU support to Windows AI language APIs, Microsoft narrows the gap between those narratives. It does not hand Nvidia the Copilot+ brand outright, but it gives RTX hardware a sanctioned role inside Microsoft’s local AI framework. That is more valuable than a marketing quote because it gives developers a Windows-supported path to the installed base Nvidia already has.
The minimum requirement of RTX 30-series hardware with 6GB of VRAM is also telling. Microsoft is not trying to support every aging GPU that can technically execute a model. It is drawing a pragmatic line around hardware with modern AI acceleration and enough memory to avoid a miserable baseline experience.
That line will still exclude some users. Low-end RTX cards with limited VRAM, older GTX machines, integrated graphics, and AMD or Intel GPUs are not part of this specific Nvidia path. But once Microsoft accepts GPUs as legitimate targets, pressure will grow for a broader hardware matrix.

Developers Now Have a More Interesting Windows AI Pitch

For developers, the biggest change is not that Phi Silica can run on an RTX GPU. It is that Microsoft’s local AI APIs now have a better chance of reaching users who did not buy a new Copilot+ laptop.
That matters because developers are allergic to tiny platform islands. If an API works only on a narrow slice of premium devices, it becomes a demo feature or an optional flourish. If it works across a meaningful portion of Windows 11 hardware, it can become a design assumption.
The API approach also abstracts some of the mess that has made local AI on PCs both exciting and chaotic. Today’s local AI scene is rich with frameworks, model formats, quantization choices, GPU backends, driver dependencies, and performance caveats. Enthusiasts can navigate that world. Most application developers would rather call a supported Windows API and let the platform handle model delivery and hardware selection.
That is the strategic opening for Microsoft. Windows does not need to beat every open-source local model runner on flexibility. It needs to be predictable, integrated, and good enough for mainstream app scenarios. Summarize this document, rewrite this email, extract structured data from this text, generate a short draft, classify this note: these are not science projects anymore.
The more Windows can make those tasks local by default, the more it can reduce latency, cloud costs, and privacy concerns. The catch is that Microsoft has to earn developer trust after years of shifting Windows app strategies. Experimental APIs are useful, but developers will wait to see what stabilizes, what ships, and what Microsoft keeps supporting.

Local AI Is Becoming a Privacy Feature, Not Just a Performance Feature

Microsoft’s cloud AI strategy is not going away. Copilot, Azure AI, Microsoft 365 Copilot, and developer-facing cloud models remain central to the company’s business. But local AI has a different kind of appeal: it can answer the growing discomfort around sending everything to remote servers.
For consumers, that appeal is intuitive. A writing tool that rewrites a paragraph locally feels less invasive than one that uploads the text. A summarizer that works on a private file without leaving the PC is easier to trust. A small model that handles routine language tasks offline is useful even when connectivity is poor.
For enterprises, the stakes are sharper. Data residency, compliance, confidentiality, and auditability all shape whether AI features can be deployed broadly. Many organizations are interested in AI but wary of uncontrolled data flows. Local inference does not solve every governance problem, but it changes the risk profile.
That is why the API layer matters. If Windows can provide local AI capabilities that developers can invoke consistently, organizations can begin to evaluate those capabilities as part of endpoint strategy rather than as a pile of separate app integrations. The PC becomes not just a client for AI, but a controlled execution environment.
This is also where Microsoft must be careful. “Local” cannot become a magic privacy word. Users and administrators need to know when data stays on device, when it is sent to the cloud, which models are installed, how updates are handled, and what telemetry surrounds the experience. If Microsoft blurs those lines, the local AI trust advantage will evaporate quickly.

The Copilot+ Brand Looks Less Like a Destination and More Like a Tier

The Copilot+ PC label is not dead, but it is changing shape. At launch, it sounded like a gate: on one side were ordinary PCs, and on the other were machines capable of the next generation of Windows. GPU support for local language APIs makes the label look more like a tier in a wider spectrum.
That may actually be healthier for Windows. The PC ecosystem has never fit cleanly into single-brand categories. There are gaming desktops with huge GPUs, fanless ultraportables with efficient NPUs, business laptops with conservative driver stacks, workstations with professional accelerators, and budget machines that barely meet Windows 11’s own requirements. A serious AI platform has to adapt to that diversity.
The risk is confusion. Microsoft has already struggled to explain the difference between Copilot, Copilot in Windows, Microsoft 365 Copilot, Copilot+ PCs, Windows AI APIs, Windows Copilot Runtime, Windows ML, and Foundry branding. Adding “some local AI works on RTX non-Copilot+ PCs, but not the famous Copilot+ features” will not make the retail story easier.
But technical reality often wins eventually. If a user’s machine has hardware capable of running a local language model, and if Windows can support it safely, an artificial block becomes harder to defend. Microsoft does not need to abandon Copilot+ branding to soften it. It can keep Copilot+ as the premium, fully validated experience while allowing specific AI capabilities to scale across other machines.
That appears to be the direction now. Copilot+ becomes the best-supported path, not the only path. For Windows, that is a significant philosophical shift.

The RTX Door Opens, but Only a Few Rooms Are Unlocked

The most practical way to read this change is as a limited but meaningful expansion. It is not a consumer rollout, not a Recall unlock, and not a universal AI upgrade for every Windows 11 PC. It is a sign that Microsoft is beginning to separate Windows AI capabilities from the Copilot+ badge where the technical case allows it.

Microsoft’s experimental GPU support applies to Windows AI language-model APIs, not the full Copilot+ feature set.
Supported Nvidia hardware currently starts with GeForce RTX 30-series GPUs or newer with at least 6GB of VRAM.
Phi Silica can be delivered through Windows Update when an application needs the local model, rather than being preinstalled on every Windows machine.
Developers gain a larger potential audience for local summarization, rewriting, text generation, and structured text features.
NPU-equipped Copilot+ PCs still have the stronger argument for low-power, always-available laptop AI experiences.
The move makes Copilot+ look less like an absolute hardware wall and more like Microsoft’s premium validation tier for Windows AI.

For users, the near-term impact will be modest unless applications adopt the APIs. For developers and IT planners, the signal is larger: Microsoft is preparing for a Windows AI ecosystem in which the accelerator might be an NPU, a GPU, or something else entirely.
Microsoft’s original Copilot+ pitch needed a clean line because new categories require simple stories. Two years later, the platform needs a messier but more durable truth: local AI on Windows will not belong to one chip, one badge, or one generation of laptops. The company can still make the NPU the centerpiece of its most polished experiences, but if Windows is going to be the place where PC AI actually happens, it has to meet capable hardware where it already lives.

References

Primary source: TechSpot
Published: Thu, 11 Jun 2026 20:25:50 GMT

Microsoft is now letting Nvidia GPUs run local AI features that were locked to Copilot+ PCs | TechSpot

When Copilot+ PCs launched on June 18, 2024, the messaging was clear: dedicated AI hardware was essential. These machines were defined in part by their neural processing...

www.techspot.com
Official source: learn.microsoft.com

Microsoft.Windows.AI Namespace - Windows App SDK | Microsoft Learn

Provides APIs for local, on-device AI features.

learn.microsoft.com
Official source: developer.microsoft.com

Windows AI | Microsoft Developer

A unified, reliable and secure platform supporting the AI developer lifecycle from model selection, fine-tuning, optimizing and deployment across CPU, GPU, NPU and cloud.

developer.microsoft.com
Related coverage: berrall.com

Microsoft is killing the Copilot+ PC advantage, brings Windows 11’s local AI to RTX 30+ PCs with 6GB vRAM - Peer Networks UK

Wales & West leading provider of PC repairs & IT support for home & business. Peer Networks delivers prompt, no fuss, PC repair services to customers.

www.berrall.com
Related coverage: blogs.nvidia.com

NVIDIA Accelerates Microsoft’s Open Phi-3 Mini Language Models | NVIDIA Blog

NVIDIA announced today its acceleration of Microsoft’s new Phi-3 Mini open language model with NVIDIA TensorRT-LLM, an open-source library for optimizing large language model inference when running on NVIDIA GPUs from PC to Cloud.

blogs.nvidia.com
Official source: azure.microsoft.com

Phi Open Models - Small Language Models | Microsoft Azure

Explore Phi models, efficient small language models (SLMs) for generative AI applications. Learn more about Phi in Azure AI Foundry.

azure.microsoft.com

Related coverage: developer.nvidia.com

AI Models | NVIDIA Developer

Explore and deploy top AI models built by the community, accelerated by NVIDIA’s AI inference platform, and run on NVIDIA-accelerated infrastructure.

developer.nvidia.com
Related coverage: build.nvidia.com

AI Models by Microsoft | Try NVIDIA NIM APIs

Experience the leading models to build enterprise generative AI apps now.

build.nvidia.com
Official source: devblogs.microsoft.com

What's new in Microsoft Foundry | February 2026 | Microsoft Foundry Blog

Explore Microsoft Foundry February 2026 featuring Claude Opus and Sonnet models for advanced reasoning and efficiency.

devblogs.microsoft.com
Related coverage: docs.api.nvidia.com

LLM APIs

Overview The Large Language Model (LLM) NIM API endpoints provide simple access to use natural language based generative AI. This single API endpoint provides access to top models for use in a wide range of tasks including: chat, instruction following, question answering, summarization, creative...

docs.api.nvidia.com
Related coverage: docs.nvidia.com

models PCs and Workstations

PDF document

docs.nvidia.com
Related coverage: nvidianews.nvidia.com

665c691a3d6332d00dbbd30a

PDF document

nvidianews.nvidia.com
Related coverage: docscontent.nvidia.com

NVIDIA AI Enterprise

Release Notes

docscontent.nvidia.com
Related coverage: nvidia.com

INTRODUCTION TO THE NVIDIA TURING ARCHITECTURE

</rdf:Alt> </dc:description> <dc:creator> <rdf:Seq> <rdf:li>NVIDIA

www.nvidia.com
Official source: microsoft.com

Shop High-Performance Laptops, Computers, PCs, and Tablets | Microsoft Windows

Shop high-performance laptops, PCs, and tablets built for multitasking, advanced AI capabilities, powerful graphics, and all-day performance. Explore premium, high-spec Windows devices.

www.microsoft.com
Official source: blogs.microsoft.com

Introducing Copilot+ PCs - The Official Microsoft Blog

An on-demand recording of our May 20 event is available. Today, at a special event on our new Microsoft campus, we introduced the world to a new category of Windows PCs designed for AI, Copilot+ PCs. Copilot+ PCs are the fastest, most intelligent Windows PCs ever built. With powerful new...

blogs.microsoft.com
Related coverage: tomshardware.com

Copilot+ PCs: All we know about the AI-ready laptops and exclusive Windows features | Tom's Hardware

Microsoft's shiny new AI innovations for the laptop space

www.tomshardware.com
Related coverage: windowscentral.com

Microsoft Copilot+ PC guide: What it is, features, how to access it, and PC requirements, and everything you need to know | Windows Central

Microsoft Copilot+ has been announced for upcoming AI PCs, but what exactly is it? Here's everything you need to know.

www.windowscentral.com
Related coverage: computerworld.com

Microsoft launches AI-powered Copilot+ PCs – Computerworld

The first batch of Copilot+ PCs will come with Qualcomm Snapdragon X series processors and will hit the shelves on June 18.

www.computerworld.com
Official source: news.microsoft.com

Microsoft presenta los Copilot+ PC, una nueva categoría de dispositivos con Windows diseñados para la IA - Source EMEA

Microsoft ha reinventado por completo el PC, situando la IA en el centro, para operar en local, dando lugar a los equipos con Windows más rápidos e inteligentes nunca vistos. Se trata del cambio más significativo en la plataforma de Windows en décadas.

news.microsoft.com
Related coverage: skywork.ai

Copilot+ PC Requirements — The Ultimate Guide

Understand Microsoft Copilot+ PC AI requirements: 40+ TOPS NPU, RAM, storage, supported processors, and which features need Copilot+. Read the complete guide.

skywork.ai
Related coverage: pcgamer.com

Microsoft plans to launch a cheaper 8 GB Surface laptop later this year which won't meet the requirements of a Copilot+ PC | PC Gamer

How far we have fallen.

www.pcgamer.com

ChatGPT · Jun 12, 2026

Microsoft has updated its Windows 11 local AI documentation in June 2026 to let developers run Phi Silica language model APIs on non-Copilot+ PCs with supported Nvidia RTX GPUs, widening on-device text AI beyond machines with dedicated NPUs. The move does not suddenly turn every gaming rig into a full Copilot+ PC, nor does it hand Recall to the GPU crowd. But it does quietly puncture one of the cleanest marketing lines Microsoft has drawn around Windows AI hardware. The new message is messier, more practical, and probably more durable: local AI on Windows is becoming a platform capability, not a single badge on a laptop lid.

Microsoft’s NPU Wall Now Has a GPU-Sized Door in It

When Microsoft introduced Copilot+ PCs in 2024, the pitch was deliberately simple. If you wanted the new wave of Windows AI features to run locally, you needed a new class of PC with a neural processing unit capable of at least 40 trillion operations per second. The NPU was not just another accelerator; it was the hardware foundation for Microsoft’s next version of the Windows client.
That simplicity was useful for marketing and for OEMs trying to sell premium laptops into a sluggish PC refresh cycle. It was also somewhat artificial. Anyone who has watched the last decade of GPU computing knows that Nvidia hardware is perfectly capable of running local language models, image models, speech models, and inference pipelines. The question was never whether GPUs could run AI workloads. The question was whether Microsoft would bless them inside its own Windows AI stack.
The answer is now yes, but with caveats. The updated Windows AI documentation says Phi Silica, Microsoft’s small on-device language model for Windows, can run on non-Copilot+ Windows 11 devices equipped with Nvidia GeForce RTX 30 series GPUs or newer, provided they have at least 6GB of VRAM. AMD GPU support is described as coming later, but today’s live path is Nvidia-first.
That is a meaningful shift because it moves Microsoft’s local language model APIs from a narrow hardware identity to a broader developer target. A Copilot+ PC still gets the cleanest story: the model runs on the NPU, with Microsoft’s intended power and latency profile. But a desktop with an RTX 3060, a gaming laptop with an RTX 4060, or a workstation with a recent Nvidia card now enters the conversation.
This is not consumer magic yet. It is plumbing. The APIs are aimed at developers building Windows apps that call into Microsoft’s local AI framework. End users will feel the change only when applications are written or updated to use those APIs.
That distinction matters because Microsoft is not shipping a big green “AI enabled” switch for every eligible RTX owner. It is expanding the surface area for developers, and that is usually how Windows platform changes become real: slowly, unevenly, and then all at once if the ecosystem finds a reason to care.

Phi Silica Becomes the Test Case for a More Flexible Windows AI Stack

Phi Silica is the center of this story because it is small enough to run locally, integrated enough to matter to Windows developers, and limited enough to reveal Microsoft’s caution. It is not GPT-5 hiding in the Start menu. It is a compact language model designed for common text tasks such as summarization, rewriting, text generation, and formatting unstructured content into more structured output.
The important part is not that these tasks are novel. They are not. Cloud tools have been summarizing emails and rewriting paragraphs for years. The point is that Phi Silica gives Windows applications a system-provided local model path without requiring every developer to ship, update, tune, and support their own model runtime.
That is the platform play. Microsoft would like app developers to think of local AI in Windows the way they think of notifications, file pickers, camera access, speech recognition, or composition effects. The operating system supplies a capability, the developer calls an API, and the hardware underneath does the work through whatever accelerator Microsoft supports.
Until now, the hardware story for Phi Silica was tied tightly to Copilot+ PCs. On those systems, the model runs on the NPU, and Microsoft can assume a more predictable power envelope. With GPU support, the same model can reach a much larger installed base, especially among enthusiasts and professionals who already own Windows 11 machines with RTX cards but have no NPU meeting Microsoft’s Copilot+ bar.
That broader base is why this documentation change matters more than its dry wording suggests. Developers do not build for platforms that look rare, fragmented, or tied to a single product cycle. By allowing Phi Silica to run on a chunk of the RTX installed base, Microsoft gives developers a better reason to experiment with local AI features now rather than waiting for Copilot+ hardware to saturate the market.
There is still friction. GPU support currently requires Developer Mode, recent Windows Insider-era components, the right Windows App SDK version, and manufacturer-provided GPU drivers rather than relying on the generic driver path many users get through Windows Update. The Phi Silica APIs are also part of a limited-access feature, which means developers need to work through Microsoft’s access process rather than simply flipping a public production switch.
That is why this should be read as a strategic preview rather than a mainstream rollout. Microsoft is laying track, not running a scheduled passenger service.

The Copilot+ Badge Loses Some Exclusivity, Not Its Purpose

The obvious reading is that Microsoft has weakened the Copilot+ PC proposition. If a non-Copilot+ machine with an Nvidia GPU can run local Windows language model APIs, why buy a Copilot+ laptop at all? That is the sort of neat conclusion that makes for a punchy headline and a shallow analysis.
The better reading is that Microsoft is separating two things it previously bundled together: the Copilot+ PC as a consumer hardware class, and Windows AI as a developer platform. The former still depends heavily on NPUs. The latter cannot afford to be confined to one accelerator category forever.
Copilot+ PCs still have advantages that GPUs do not erase. NPUs are designed for sustained, low-power inference, especially on laptops. They can run AI workloads without waking the discrete GPU, draining the battery, heating the chassis, or competing with games, rendering software, video playback, or GPU-accelerated creative tools. That matters if AI is supposed to become ambient rather than occasional.
The updated Microsoft documentation is unusually clear on this point. GPU execution of Phi Silica is expected to have different performance and power characteristics from NPU execution. Latency may be higher. Battery impact may be worse. The model may compete with other GPU workloads. Features available on the NPU path, such as prompt compression and speculative decoding, are not currently available on the GPU path.
In other words, Microsoft is not saying an RTX-equipped desktop is the same thing as a Copilot+ ultrabook. It is saying the same local model can now run on more machines, with a different trade-off profile. That is the sort of compromise Windows has always made.
For desktop users, the trade-off may be perfectly acceptable. A tower PC with a plugged-in RTX 4070 does not care much about battery life, and a workstation user may prefer local inference over a cloud round trip even if the model is not blazing fast. For laptop users, the calculus is more complicated. A discrete GPU may be available, but using it for background AI tasks can turn a quiet productivity machine into a warm, noisy one.
This is where the Copilot+ badge keeps its purpose. It remains shorthand for a machine designed around local AI as a first-class, always-available workload. Nvidia GPU support, by contrast, makes local language model APIs available to a broader but less uniform set of PCs.

Developers Get a Bigger Addressable Market, but Also a Bigger Testing Problem

For Windows developers, the upside is obvious. A feature that only works on Copilot+ PCs is a niche feature, at least until the installed base catches up. A feature that also works on recent Nvidia GPUs reaches gamers, creators, engineers, researchers, and power users who often run high-end hardware long before they buy a new AI-branded laptop.
That matters for application categories where local text intelligence is useful but cloud dependence is awkward. A note-taking app could summarize meeting notes without sending them to a remote service. A code editor could offer limited local explanation or transformation features when the user is offline. A legal, medical, or enterprise workflow tool could use local rewriting or formatting while keeping sensitive drafts on the device, though developers would still need to handle accuracy, policy, and data governance carefully.
The problem is that the Windows PC ecosystem is not a console. Supporting “RTX 30 series and newer with 6GB of VRAM” sounds tidy until it collides with real-world machines. There are desktop cards and laptop GPUs, OEM drivers and Nvidia beta drivers, thermal envelopes and power settings, external monitors and hybrid graphics, background game launchers and creative apps already consuming VRAM.
Microsoft’s own notes acknowledge this indirectly by warning that GPU inference depends on GPU generation, available VRAM, driver state, and current load. That is not a footnote. It is the operational reality developers will need to design around.
A well-built app cannot assume that local AI is available just because the user has a supported GPU on paper. It needs runtime checks, graceful fallbacks, clear error messages, and probably a cloud or non-AI path when the local model is missing, unavailable, too slow, or disabled. It also needs to avoid presenting local AI as a magic privacy shield if the rest of the application still syncs, logs, or uploads user content elsewhere.
This is why Microsoft’s decision to make Phi Silica a system-managed component is important. If every app shipped its own language model, Windows would become a junk drawer of duplicate weights, conflicting runtimes, and unpredictable update mechanisms. A shared platform model downloaded and serviced through the operating system is cleaner, at least in theory.
But the theory only works if Microsoft keeps the platform stable. Developers burned by experimental APIs, branding churn, and limited-access gates will not bet core product experiences on a feature that feels like it may be renamed, restricted, or superseded in six months. Microsoft has spent the last two years cycling through terms like Windows Copilot Runtime, Windows AI Foundry, Microsoft Foundry on Windows, and Windows AI APIs. At some point, the vocabulary has to stop moving if the platform underneath is supposed to look dependable.

Nvidia Wins the First Round Because Windows AI Needs Real Silicon Today

The Nvidia-first nature of the rollout is not surprising. Nvidia owns the cultural and practical mindshare around local AI on PCs. CUDA, TensorRT, RTX branding, and the sheer size of the installed base give Microsoft a ready-made path to developers who already think of GPUs as AI hardware.
For Windows enthusiasts, this is also the most intuitive version of the story. Many users who built or bought gaming PCs in the last few years already own more AI acceleration than the average thin-and-light laptop, even if their machines do not qualify as Copilot+ PCs. The idea that those systems were locked out of Microsoft’s local AI APIs while lower-power NPU laptops were welcomed in always felt more like market segmentation than technical necessity.
Still, the Nvidia dependency cuts both ways. If Windows AI features become more useful on Nvidia hardware than on AMD or Intel hardware, Microsoft risks turning part of the Windows developer story into another GPU ecosystem advantage. That may be acceptable in an experimental phase. It becomes more uncomfortable if local AI becomes a standard expectation for productivity software.
Microsoft says AMD GPU support is planned, but the absence of Intel GPU support from the current headline is notable. Intel has pushed AI PCs aggressively, ships integrated GPUs at massive scale, and has its own NPU story in recent Core Ultra platforms. AMD has both Radeon GPUs and Ryzen AI NPUs. Qualcomm, meanwhile, helped launch the first wave of Copilot+ PCs with Arm-based Snapdragon X chips.
A healthy Windows AI platform cannot remain Nvidia-only outside Copilot+ machines. The Windows franchise is built on hardware pluralism. Users may tolerate “best on Nvidia” in gaming and creative acceleration, but core OS-level AI APIs need to feel broadly available or at least predictably tiered across vendors.
There is also a competitive subtext. Nvidia has been working to make RTX PCs feel like local AI workstations, not just gaming machines. Microsoft, meanwhile, wants Windows to be the place where local AI applications are built and consumed. The two strategies align for now. Nvidia supplies the installed base and performance story; Microsoft supplies the operating system APIs and developer funnel.
The interesting question is who owns the developer relationship in the long run. If developers call Microsoft’s Windows AI APIs, Microsoft owns the abstraction. If developers bypass them for Nvidia’s own tools, model runtimes, and agent frameworks, Windows becomes the stage but not the platform. This Phi Silica expansion is Microsoft’s way of keeping itself in the middle.

Recall Remains the Line Microsoft Is Not Crossing

The update does not bring Windows Recall to non-Copilot+ PCs. It does not unlock Click to Do across RTX desktops. It does not make every Copilot+ feature portable to a GPU-backed Windows 11 machine. That boundary is important because Recall is not just another model invocation.
Recall is an operating-system-level feature that periodically captures and indexes user activity so it can be searched later. Its controversies have always been about security, privacy, consent, and data handling as much as hardware acceleration. Moving it to a broader set of PCs would require Microsoft to revisit not just performance assumptions but trust assumptions.
By contrast, the language model APIs now expanding to Nvidia GPUs are developer-facing and task-oriented. An app asks the model to summarize text, rewrite content, generate output, or perform a related language task. That is a more contained scenario than building a persistent, searchable memory of user activity across the desktop.
Microsoft is therefore making the least explosive expansion first. Text APIs are useful, developer-friendly, and easier to explain. They also let Microsoft gather experience with GPU-backed local inference without reopening every debate about Recall on day one.
The lack of Recall support should not be treated as a technical impossibility. GPUs could accelerate pieces of such a pipeline. But product eligibility is not the same thing as silicon capability. Microsoft has every incentive to keep the most sensitive Copilot+ features tied to machines it can define, certify, and support more tightly.
That said, the GPU opening makes future boundaries harder to justify if they are framed purely as hardware limitations. If Microsoft says a feature requires a Copilot+ PC because it needs local AI acceleration, users with powerful GPUs will now have an obvious counterargument. The company will need to explain when the requirement is about performance, when it is about battery life, when it is about security architecture, and when it is simply about product segmentation.
The old answer — “you need an NPU” — is no longer enough.

Local AI Is Becoming a Windows Distribution Problem

One overlooked part of the change is how Phi Silica gets onto a machine. Microsoft’s model is not necessarily preinstalled everywhere. It can be downloaded on demand when an application requires it, managed as a Windows AI component, and removed by the user through system settings.
That sounds mundane, but it is critical. Local AI models are large enough to matter, updated often enough to require servicing, and sensitive enough to raise security and compliance questions. If Windows is going to provide shared models as platform components, then model distribution becomes part of operating system maintenance.
This has benefits. A centrally managed model can receive updates, policy controls, and compatibility fixes without every application reinventing the wheel. It can also reduce duplication, because ten apps can call the same underlying model instead of shipping ten slightly different runtimes into user storage.
But it also creates new administrative questions. Enterprise IT teams will want to know when models are downloaded, where they are stored, how they are patched, whether they can be blocked, what telemetry is generated, and whether model availability changes application behavior. A feature that looks like a developer convenience on a consumer PC can become a governance issue in a managed fleet.
The GPU driver requirement adds another wrinkle. Microsoft’s documentation warns that the latest manufacturer driver may be required and that Windows Update or OEM-provided drivers may not be sufficient. That is an old Windows tension in a new costume. Enterprises like predictable driver channels. AI frameworks often want the newest acceleration stack.
For enthusiasts, installing Nvidia’s latest beta or production driver is routine. For corporate IT, it is a change-management event. If local AI features depend on drivers outside the normal OEM support cadence, adoption will be slower in business environments no matter how compelling the APIs look.
That does not make the move unimportant. It means Microsoft’s next job is not only technical enablement; it is operational domestication. Local AI has to become boring enough to manage.

The Privacy Pitch Is Real, but It Is Not Self-Executing

Local inference has an obvious appeal: the prompt and output can stay on the device. For users who are wary of sending drafts, notes, source code, documents, or private messages to a cloud model, that is a real advantage. It is also one of the few AI pitches that still resonates with skeptical Windows users.
But “local” is not the same as “private by default.” An application can call a local model and still sync the document to a cloud service. It can generate logs. It can collect telemetry. It can offer a local mode for one feature and a cloud mode for another. The model’s location is only one part of the privacy story.
Microsoft’s own responsible AI materials make the other limitation clear: local models can still hallucinate, produce biased output, misunderstand context, and generate plausible nonsense. Running on an RTX card instead of in a data center does not make a model more truthful. It only changes where computation happens.
That is especially important for the likely first wave of use cases. Summarization and rewriting sound low-risk until they are applied to legal contracts, medical instructions, HR complaints, security logs, or financial documents. Developers need to decide whether local AI output is assistive text, a draft, a suggestion, or an action trigger. Those distinctions should be visible in the user interface, not buried in a policy page.
For WindowsForum readers, the practical advice is to treat local AI like any other local automation tool. It can be valuable, especially when it reduces cloud exposure or works offline. But it should not be trusted blindly, and it should not be allowed to blur the line between assisting a user and acting on their behalf.
The GPU expansion increases the number of machines that can participate in this experiment. It does not remove the need for judgment.

The RTX Door Opens, but the House Is Still Under Construction

The concrete facts are straightforward enough, but the implications are larger than a hardware compatibility note. Microsoft is broadening Windows 11’s local language model APIs beyond Copilot+ PCs, starting with Nvidia RTX GPUs. That expands the developer target, complicates the Copilot+ message, and gives existing high-end PCs a role in the Windows AI roadmap.
The near-term reality is narrower. This is still developer-facing, still gated by API availability and system prerequisites, and still limited to Phi Silica language features rather than the full Copilot+ portfolio. Most users will not notice anything until software they already use adopts these APIs.
The most useful way to read the change is not as a consumer launch, but as Microsoft admitting that Windows AI cannot be NPU-only if it wants to become a real platform.

Phi Silica can now run through Windows AI APIs on non-Copilot+ Windows 11 PCs with supported Nvidia RTX 30 series or newer GPUs and at least 6GB of VRAM.
The expansion is aimed at developers first, so end users need applications that are built or updated to call these local language model APIs.
Copilot+ PCs still have the cleaner NPU path, with better power characteristics and features such as prompt compression and speculative decoding that are not currently available on the GPU path.
Recall, Click to Do, and other Copilot+ experiences remain outside this GPU expansion.
AMD GPU support is planned, but the current supported non-Copilot+ GPU path is Nvidia-first.
The change makes local AI more realistic for desktops, gaming PCs, and workstations, but it also introduces driver, VRAM, thermal, and enterprise-management complications.

Microsoft’s original Copilot+ story treated the NPU as the admission ticket to the local AI future; this update makes the ticket booth more complicated and more honest. Windows has always succeeded when it absorbed hardware diversity instead of pretending it did not exist, and local AI will be no different. The next phase will be judged less by whether Microsoft can produce another badge and more by whether developers can rely on a stable, well-supported Windows AI layer that runs acceptably across the PCs people already own.

References

Primary source: gHacks
Published: Fri, 12 Jun 2026 11:24:44 GMT

Microsoft Enables Nvidia GPU Support for Windows 11 Local Language Model APIs - gHacks Tech News

Microsoft has opened Windows 11's local language model APIs to non-Copilot+ PCs with Nvidia RTX 30-series GPUs and 6GB or more of VRAM.

www.ghacks.net
Related coverage: techradar.com

Microsoft is bringing AI features to more Windows 11 PCs — just in case you were under the impression that AI was being cut back | TechRadar

There's no need for an NPU for certain AI features now, as an Nvidia GPU will do the job

www.techradar.com
Official source: developer.microsoft.com

Windows AI | Microsoft Developer

A unified, reliable and secure platform supporting the AI developer lifecycle from model selection, fine-tuning, optimizing and deployment across CPU, GPU, NPU and cloud.

developer.microsoft.com
Related coverage: berrall.com

Microsoft is killing the Copilot+ PC advantage, brings Windows 11’s local AI to RTX 30+ PCs with 6GB vRAM - Peer Networks UK

Wales & West leading provider of PC repairs & IT support for home & business. Peer Networks delivers prompt, no fuss, PC repair services to customers.

www.berrall.com
Official source: learn.microsoft.com

Platform card - Phi Silica | Microsoft Learn

Learn about Phi Silica's features, capabilities, intended uses, and responsible AI considerations.

learn.microsoft.com
Official source: blogs.microsoft.com

Microsoft at NVIDIA GTC: New solutions for Microsoft Foundry, Azure AI infrastructure and Physical AI - The Official Microsoft Blog

Microsoft combines accelerated computing with cloud scale engineering to bring advanced AI capabilities to our customers. For years, we’ve worked with NVIDIA to integrate hardware, software and infrastructure to power many of today’s most important AI breakthroughs. What’s new at NVIDIA GTC...

blogs.microsoft.com

Related coverage: tomshardware.com

Nvidia unveils RTX Spark Superchip for laptops and desktop PCs at Computex 2026 – new platform promises to turn Windows into an agentic AI OS with Arm CPU, Blackwell GPU, and 128GB unified memory | Tom's Hardware

Over 30 laptops and 10 desktops coming this fall with "the most efficent platform ever built"

www.tomshardware.com
Related coverage: developer.nvidia.com

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA | NVIDIA Technical Blog

AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to assist with day-to…

developer.nvidia.com
Official source: github.com

GitHub - microsoft/PhiCookBook: This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) avai

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety...

github.com
Related coverage: docs.nvidia.com

models PCs and Workstations

PDF document

docs.nvidia.com
Related coverage: nvidianews.nvidia.com

677c9ef3ed6ae50e80f57916

PDF document

nvidianews.nvidia.com

ChatGPT · Jun 15, 2026

Microsoft’s Windows App SDK 2.2 Experimental 9, released in June 2026, lets Windows 11’s local language model APIs run on supported Nvidia GPUs in non-Copilot+ PCs, starting with GeForce RTX 30-series cards or newer with at least 6GB of VRAM. That does not make neural processing units useless overnight. It does, however, puncture the clean marketing story Microsoft has spent two years building around Copilot+ PCs: that local Windows AI belonged, by definition, to machines with a 40 TOPS NPU. The new reality is messier, more useful, and more threatening to the neat hardware labels that have driven the AI PC push.

Microsoft Just Broke Its Own Copilot+ Shortcut

The Copilot+ PC label was never only about performance. It was a purchasing shortcut, a retail shelf marker, and a message to developers that Windows finally had a dependable local AI target. If a PC had 16GB of memory, sufficient SSD storage, and an NPU capable of at least 40 trillion operations per second, Microsoft could say: this machine is ready for the next generation of Windows experiences.
GPU support for the Windows AI language model APIs complicates that pitch. Microsoft is not suddenly handing every gaming tower the full Copilot+ feature set, and the first GPU path is still experimental, gated, and narrow. But the conceptual wall has moved. Local AI in Windows is no longer synonymous with Copilot+ certification.
That matters because Microsoft’s AI PC strategy depends on trust in categories. “Copilot+ PC” was supposed to spare buyers from studying silicon diagrams, model runtimes, and driver stacks. If Windows’ own local language APIs can now run on a sufficiently modern Nvidia GPU, the question changes from “Do I need an NPU?” to “Which accelerator is Windows actually going to use, and for what?”
The answer is not flattering to the simplicity of the original campaign. NPUs remain important, especially in mobile systems where power draw and thermals are the whole game. But GPUs are too numerous, too powerful, and too deeply entrenched in enthusiast and workstation PCs for Microsoft to ignore. Windows cannot become an AI platform by pretending that the installed base of RTX hardware does not exist.

The First Crack Is Small, but It Is Real

The immediate change is specific: the Language Model APIs, including Microsoft’s Phi Silica path, can run on non-Copilot+ Windows 11 PCs with supported GPUs. The supported hardware list begins with Nvidia GeForce RTX 30-series and newer cards with 6GB or more of VRAM. AMD support is described as coming later, while Intel is not yet part of this first GPU wave.
There are other catches. GPU inference requires Developer Mode. It requires a Windows Insider Experimental Channel build. It requires the right Windows App SDK experimental release, and in practice the latest appropriate GPU driver from the manufacturer rather than whatever Windows Update or an OEM image happened to install. The model is not preinstalled, either; apps are expected to check readiness, ask for consent, and download the model on demand through Windows Update.
That is not a consumer rollout. It is a developer preview of an execution path. Anyone treating this as a blanket unlock of Copilot+ on gaming PCs is getting ahead of the evidence.
Still, platform shifts often begin exactly this way. Microsoft does not need to enable Recall, Cocreator, every imaging API, and every future Windows AI feature on day one to alter the strategic landscape. It only needs to show that the Windows AI stack can address GPUs as first-class local accelerators. Once that abstraction exists, developers and users will reasonably expect more of it.
The important phrase is local language model capabilities. Language models are the most developer-visible part of the on-device AI story, because they can be embedded in writing tools, summarizers, coding utilities, search interfaces, mail clients, note-taking apps, and enterprise workflows. If those APIs are available beyond Copilot+ PCs, Microsoft has widened the useful surface area of Windows AI far more than a single settings-page toggle would suggest.

NPUs Were Sold as the Future, Not Just Another Backend

Microsoft’s original NPU argument was not irrational. Dedicated neural accelerators are built to run AI operations efficiently, often at lower power and with less contention than a GPU. On a thin-and-light laptop, that is not a footnote. It is the difference between a feature that can run quietly in the background and one that turns a productivity machine into a hand warmer.
That is why the Copilot+ specification focused on an NPU threshold rather than raw GPU muscle. The target was not a liquid-cooled desktop with a 350-watt graphics card. It was the mainstream laptop: battery powered, thermally constrained, and expected to deliver AI features without destroying the user experience. For that machine, an NPU makes sense.
The problem is that Microsoft’s marketing flattened a technical preference into a category boundary. NPUs became the magical ingredient. PCs without them were implicitly behind, even if they contained discrete GPUs capable of running far larger AI workloads than the small local models Windows was exposing through its APIs.
That tension was obvious from the start to gamers, creators, and developers. A desktop RTX card has long been the obvious home for local inference experiments. Nvidia’s CUDA ecosystem and the broader AI tooling around GPUs are not side shows; they are the reason modern AI infrastructure looks the way it does. When Windows said “local AI” but ignored those cards, the platform felt less like it was following the hardware and more like it was enforcing a branding strategy.
Now Microsoft is relaxing that posture. It is not abandoning the NPU, but it is acknowledging that the best accelerator for a job depends on the machine in front of you.

The GPU Is the Wrong Tool Until It Is the Only One Users Already Have

The case against GPUs for built-in Windows AI is easy to make. They draw more power. They are often busy rendering games, driving displays, encoding video, or accelerating creative applications. They produce heat, require driver coordination, and vary widely in memory and capability. In a laptop, invoking the discrete GPU for a background summarization task can be the opposite of elegant engineering.
But the case for GPUs is even simpler: millions of Windows PCs already have them. For desktops, gaming laptops, creator workstations, and engineering rigs, the GPU is often the most capable local compute device in the system by a huge margin. Ignoring it because it does not fit the Copilot+ certification story would be platform malpractice.
There is also a developer argument. Developers do not want to write one AI feature for Copilot+ laptops, another for RTX desktops, another for cloud fallback, and another for unsupported machines. They want capability detection and an API layer that routes work sensibly. Microsoft’s readiness checks, model installation flow, and AI Components settings are not glamorous, but they are the bones of that model.
The Windows AI API hardware table already points toward a fragmented future. Some capabilities run on NPUs. Some are expanding to GPUs. Some can run on CPUs. Some remain NPU-only. That is less marketable than “buy a Copilot+ PC,” but it is closer to how Windows has always survived: by abstracting a messy hardware ecosystem without pretending the mess is not there.
This is where the NPU panic becomes overstated. GPU support does not make NPUs useless; it makes them one backend among several. The real demotion is not technical. It is narrative.

Copilot+ Was Always a Certification, Not a Law of Physics

The most important thing to understand about Copilot+ is that it is a Microsoft-defined class of PC. It is not a universal law about what hardware can run inference. It is a compatibility and experience promise wrapped in branding.
That distinction was easy to miss during the launch cycle, because the software features and hardware requirements were deliberately tied together. Recall, Live Captions translations, image creation, and other local AI experiences were presented as the practical payoff for buying into the new class. The NPU was not merely recommended; it was the gate.
That gate served several purposes. It gave Qualcomm’s Snapdragon X launch a clean Windows story. It gave AMD and Intel a target for next-generation laptop silicon. It gave OEMs a reason to refresh lineups and gave retailers a sticker to explain why a new machine was different from last year’s model. Microsoft was not just defining a feature requirement; it was trying to restart the PC upgrade cycle.
GPU support chips away at that strategy because it says some of the underlying software value is not inherently tied to Copilot+ hardware. If language models can run acceptably on a gaming laptop from 2021, the upgrade argument gets harder. If more APIs follow, the Copilot+ label starts to look less like a technical necessity and more like a curated best-experience badge.
That may be where it ends up. “Copilot+” could become less like “this is the only PC that can run Windows AI” and more like “this is the PC that runs the full supported suite efficiently, quietly, and predictably.” That is a weaker marketing line, but a more honest engineering one.

Recall Still Casts a Long Shadow

Any discussion of local Windows AI eventually runs into Recall. Microsoft’s controversial timeline feature became the public symbol of Copilot+ ambition and anxiety: useful in theory, invasive in perception, and complicated enough in security terms that Microsoft had to rework its rollout.
GPU support for language model APIs does not automatically mean Recall is coming to every RTX desktop. It should not be read that way. Recall depends on more than text generation; it involves capture, indexing, semantic retrieval, storage protections, user controls, and a trust model that Microsoft has been forced to explain repeatedly.
But Recall matters here because it shaped the public understanding of NPUs. For many users, the NPU was not a general-purpose accelerator waiting for third-party developers. It was the special chip required for that thing Windows wanted to do in the background. When Microsoft expands local AI beyond NPUs, it inevitably reopens questions about which Copilot+ experiences were truly hardware-bound and which were policy-bound.
That distinction will matter to IT departments. If a feature can technically run on GPUs but remains blocked by certification, admins will want to know why. If Microsoft enables some APIs on GPUs but keeps others NPU-only, developers will want consistent guidance. If consumers see “AI Components” downloads arriving on non-Copilot+ PCs, they will expect Windows to explain what is installed, what runs locally, and what hardware it uses.
The privacy argument also changes depending on hardware. Local inference is often presented as safer than cloud processing because data need not leave the device. That benefit can apply whether the model runs on an NPU or a GPU. If Microsoft wants local AI to be a trust story, it cannot reserve that story only for the newest laptop category.

The Silicon Economics Are More Subtle Than “Useless”

The enthusiast complaint that NPUs are “useless silicon” has always contained a grain of truth and a large dose of impatience. In many PCs, the NPU spends most of its life idle because the software ecosystem has not caught up. Users can feel the benefits of a faster CPU or GPU immediately. The NPU’s value depends on software that may or may not arrive.
That makes the die-area question real. Silicon budgets are finite. If a laptop processor devotes space to an NPU, that space cannot also be used for more CPU cache, more GPU execution units, better media blocks, or other features. Hardware vendors are betting that on-device AI workloads will become common enough to justify the allocation.
Microsoft’s GPU move does not invalidate that bet, but it lowers the exclusivity premium. If Windows AI features increasingly run across NPU, GPU, and CPU paths, then the NPU must justify itself on efficiency and availability, not access. That is a harder argument to sell in a spec sheet, though it may be the better argument in daily use.
For laptop makers, the NPU still has a role. A background summarizer, transcription engine, semantic indexer, or image enhancement tool that can run at low power is valuable. For desktops, the calculus is different. If the machine already has an RTX 4070 or 5080 class GPU, adding an NPU to the CPU package is less compelling unless Windows and applications use it constantly and intelligently.
That is the real threat to NPUs: not that GPUs are faster, because everyone already knew that, but that software availability may stop being exclusive. Once exclusivity fades, NPUs have to compete on their proper merits. Some will. Weak ones will not.

Nvidia Gets the First Seat Because the Software Stack Is the Product

It is not surprising that Nvidia is first. The AI world runs heavily on Nvidia software, not just Nvidia silicon. Drivers, libraries, developer habits, model tooling, and optimization paths all matter. If Microsoft wanted a plausible GPU preview for local language APIs, starting with RTX cards was the obvious move.
The RTX 30-series cutoff is also telling. Microsoft is not trying to support every old GPU that can technically run matrix math. It is drawing a line around hardware with enough modern capability and VRAM to provide a supportable experience. The 6GB VRAM requirement is modest by enthusiast standards, but it still excludes a meaningful chunk of older or lower-end machines.
AMD support being “coming soon” is important, but not enough by itself. For this to become a true Windows platform feature rather than an Nvidia-adjacent preview, AMD and eventually Intel need real support with clear driver requirements and comparable behavior. Windows users do not think in terms of accelerator backends. They think in terms of whether the feature works on the PC they bought.
That is where Microsoft’s burden is larger than Nvidia’s. Nvidia can optimize for its hardware and trumpet RTX AI readiness. Microsoft has to make the Windows AI layer feel coherent across Arm laptops, Intel ultrabooks, AMD handhelds, gaming desktops, enterprise fleets, and virtualized environments. It has done this kind of abstraction before. It has also made enough driver-model and feature-gating messes for Windows users to be wary.
If GPU AI support becomes another matrix of Insider builds, beta drivers, consent prompts, and unsupported edge cases, it will reinforce the belief that Windows AI is a moving target. If it becomes a stable API surface with honest readiness checks and predictable fallbacks, it could do more for local AI adoption than the Copilot+ launch did.

Developers Needed Reach More Than Purity

The developer angle may be the most important part of this story. Copilot+ PCs created a clean minimum target, but the installed base was initially small compared with the broader Windows ecosystem. Developers building Windows apps need users, and users are scattered across years of hardware generations.
A local language API that only works on a narrow slice of new laptops is interesting. One that can also work on RTX desktops, gaming laptops, and creator systems is much more attractive. Even if the GPU path is experimental today, it signals that Microsoft understands the classic platform chicken-and-egg problem. Developers will not build for hardware users do not have; users will not value hardware without software.
The readiness model is also a sign that Microsoft is thinking like a platform vendor. Apps are supposed to check whether the AI feature is ready, whether a model download is needed, and whether the device supports the path. That is less exciting than a demo, but it is how real applications avoid breaking on unsupported machines.
There is still friction. Requiring Developer Mode and an Experimental Channel build keeps this firmly in the early-adopter lane. Large model downloads through Windows Update will need clear UX, especially in managed environments with bandwidth controls and update policies. Driver requirements will create support calls. The moment an app says “your GPU is supported, but your driver is not,” ordinary users will blame the app, Windows, Nvidia, or all three.
But developers can work with complexity if the direction is stable. What they cannot work with is a platform story that changes every quarter. Microsoft’s challenge is to make GPU support feel like the beginning of a roadmap, not a concession to bad Copilot+ optics.

Enterprise IT Will See Both Opportunity and Another Policy Problem

For enterprises, GPU support is a mixed blessing. On one hand, it could extend local AI capability to workstations and engineering laptops that will not be refreshed into Copilot+ fleets anytime soon. That matters in organizations where hardware replacement cycles are measured in years, not keynote seasons.
On the other hand, it introduces another control surface. Local models downloaded through Windows Update, stored under AI Components, and invoked by third-party apps are exactly the kind of thing IT teams will want to inventory, approve, block, or route through policy. The fact that inference happens locally does not eliminate governance questions. It moves them onto the endpoint.
Bandwidth is one practical issue. Several-gigabyte model downloads are not free in a distributed enterprise, especially when they arrive through mechanisms already used for operating system servicing. Storage is another. So is help-desk support when a feature works on one RTX laptop but not another because of VRAM, driver provenance, Insider channel status, or policy.
Security teams will also ask what data enters the model, where prompts are logged, how outputs are handled, and whether local inference changes data classification rules. Microsoft can answer some of this through transparency notes and API guidance, but administrators will want enforceable controls, not just documentation.
The upside is that local AI can be attractive to regulated environments precisely because it can reduce dependence on cloud round trips. If Microsoft makes the controls good enough, GPU support could help enterprises test practical AI features on existing high-end Windows devices. If the controls are vague, it will become one more thing to disable until the dust settles.

The Copilot Brand Has a Naming Problem, Not a Death Sentence

The claim that GPU support kills Copilot branding goes too far. Microsoft has attached the Copilot name to a chatbot, a key on laptops, a Microsoft 365 subscription tier, a Windows sidebar concept, a developer assistant, and a class of PCs. The brand is not dying because a local language model can run on an RTX card. If anything, the problem is that Copilot already means too many things.
Copilot+ PC has always been the most hardware-specific version of that brand. It tells a buyer that the device meets Microsoft’s requirements for a certain local AI experience. But as Windows AI APIs expand beyond that boundary, Microsoft will have to sharpen the distinction between Copilot as a service, Copilot+ as a PC class, and Windows AI as a developer platform.
That distinction is currently too blurry. A user may reasonably ask why a non-Copilot+ PC can run a local Microsoft language model but cannot access a particular Copilot+ feature. Another may ask why a laptop with an NPU below the threshold is an “AI PC” but not a Copilot+ PC. A gamer may ask why a GPU that can run massive local models does not count toward certification.
These are not pedantic questions. They affect buying decisions. They affect whether users feel upgraded or excluded. They affect whether OEMs can explain their products without sliding into sticker soup.
The healthiest outcome would be for Microsoft to let Copilot+ become a premium assurance label while letting Windows AI become broadly hardware-adaptive. That would admit the obvious: there are many useful local AI experiences, and not all of them require the same accelerator.

This Is the Beginning of a Backend War Inside Windows

Underneath the branding dispute is a deeper platform transition. Windows is learning to schedule AI work across CPU, GPU, and NPU in the same way earlier generations of the OS had to learn graphics acceleration, video decode, power states, hybrid graphics, and heterogeneous CPU cores. The user should not need to know which block is doing the work. But for now, developers and admins absolutely do.
That transition will be uneven. Some features will remain NPU-only because they need low-power background execution or because Microsoft has only validated them there. Some will move to GPUs because memory bandwidth and raw throughput matter more. Some will run on CPUs because latency requirements are modest and universality matters. Some will fall back to the cloud because the local machine is not suitable.
The winners will be the APIs that hide this without lying. A good Windows AI feature should say whether it is available, whether it needs a download, what it will cost in storage, and how to disable or remove it. A bad one will simply fail, spin up fans unexpectedly, or tell users to buy a new PC without explaining why.
Microsoft’s new GPU support suggests it knows the future cannot be NPU-only. But the platform still has to prove that this will not become another DirectX-style compatibility maze, except with AI models instead of shader models. Windows users have lived through enough “supported but not really” hardware experiences to be skeptical.
The irony is that Microsoft may end up making NPUs more credible by making them less exclusive. If users can compare the same AI capability across NPU and GPU paths, the NPU’s efficiency advantage becomes tangible rather than theoretical. A quiet laptop running a useful local model all afternoon is a better advertisement than any TOPS number.

The RTX Door Opens, but It Does Not Open the Whole House

The practical reading is narrower than the hype and broader than Microsoft’s original Copilot+ story. GPU support for local language APIs is a real expansion of Windows AI. It is not, at least yet, a universal Copilot+ unlock.
That distinction matters because Windows AI is not one feature. It is a family of APIs and experiences with different hardware support. Phi Silica on GPU is the headline because language generation is central to modern AI apps. But image generation, OCR, super resolution, speech recognition, semantic search, and Recall-related capabilities each have their own constraints and rollout paths.
Microsoft’s own hardware-support matrix makes this clear. Some APIs remain NPU-only. Some have CPU paths. The GPU column is not a magic substitute for the Copilot+ column. It is an expansion route for selected workloads.
For users, the near-term consequence is simple: if you own a supported RTX system and are willing to live on experimental builds, Windows’ local AI developer platform is becoming more relevant to your hardware. For most consumers, nothing changes today. For developers, the signal is more important than the current install ritual.
For OEMs, the message is more uncomfortable. The NPU is no longer the only ticket into the Windows local AI conversation. It is the ticket into Microsoft’s preferred, certified, efficient laptop experience. That is still valuable, but it is not the same thing.

The Real Answer Is That NPUs Must Now Earn Their Keep

Microsoft is not making NPUs useless. It is making them compete.
That is healthy for Windows. A platform as broad as Windows cannot afford to bind its AI future to one accelerator type, especially when many of its most capable existing machines already contain powerful GPUs. The Copilot+ launch needed a clean spec. The Windows ecosystem needs flexibility.
The NPU’s future depends on whether the software stack uses it often enough and well enough that users notice the difference. If the NPU enables always-available local features without fan noise, battery drain, or privacy compromise, it will not be useless silicon. It will be one of the reasons modern laptops feel modern. If it remains idle while the real AI action moves to GPUs and the cloud, the skeptics will have been right.
That outcome is now more in Microsoft’s hands than the hardware vendors’. AMD, Intel, Qualcomm, and Nvidia can ship the silicon, drivers, and TOPS numbers. Microsoft decides which Windows experiences run where, which APIs developers can trust, and which labels it asks consumers to believe.

The Shape of the Windows AI PC Is Finally Becoming Visible

The most concrete lesson from this experimental release is that Windows AI is moving away from a single hardware doorway and toward a tiered model where different accelerators unlock different experiences.

Windows 11’s local language model APIs are gaining experimental GPU support on non-Copilot+ PCs, beginning with Nvidia GeForce RTX 30-series and newer GPUs with at least 6GB of VRAM.
This first GPU path is not a mainstream consumer rollout, because it requires Developer Mode, an Insider Experimental Channel build, compatible drivers, and on-demand model installation.
Copilot+ PCs still matter because their NPUs provide Microsoft’s validated, efficient path for supported local AI features, especially on laptops.
GPU support weakens the idea that Windows local AI must be exclusive to NPU-equipped Copilot+ systems.
Developers benefit if Microsoft keeps expanding Windows AI APIs across hardware backends while preserving clear readiness checks and user consent for model downloads.
The NPU’s long-term value will depend less on TOPS marketing and more on whether Windows and third-party apps use it for practical, low-power work users can actually feel.

The NPU is not dead; the slogan is. Microsoft’s first serious GPU opening for Windows local language models is an admission that the AI PC will not be one kind of PC, one accelerator, or one badge on a retail box. The next phase will be won by the machines that run useful local AI predictably, privately, and efficiently — whether the silicon doing the work is called an NPU, a GPU, or something Windows users never have to think about at all.

References

Primary source: OC3D
Published: 2026-06-15T11:30:07.595921

Are NPUs useless? - Windows 11's Local AI is getting GPU support - OC3D

Microsoft is adding Nvidia GPU support to its Local AI tools, making dedicated NPU hardware practically useless.

overclock3d.net
Related coverage: techradar.com

Microsoft is bringing AI features to more Windows 11 PCs — just in case you were under the impression that AI was being cut back | TechRadar

There's no need for an NPU for certain AI features now, as an Nvidia GPU will do the job

www.techradar.com
Official source: microsoft.com

Shop High-Performance Laptops, Computers, PCs, and Tablets | Microsoft Windows

Shop high-performance laptops, PCs, and tablets built for multitasking, advanced AI capabilities, powerful graphics, and all-day performance. Explore premium, high-spec Windows devices.

www.microsoft.com
Official source: developer.microsoft.com

Windows AI | Microsoft Developer

A unified, reliable and secure platform supporting the AI developer lifecycle from model selection, fine-tuning, optimizing and deployment across CPU, GPU, NPU and cloud.

developer.microsoft.com
Related coverage: windowscentral.com

Microsoft Copilot+ PC guide: What it is, features, how to access it, and PC requirements, and everything you need to know | Windows Central

Microsoft Copilot+ has been announced for upcoming AI PCs, but what exactly is it? Here's everything you need to know.

www.windowscentral.com
Related coverage: tomshardware.com

Copilot+ PCs: All we know about the AI-ready laptops and exclusive Windows features | Tom's Hardware

Microsoft's shiny new AI innovations for the laptop space

www.tomshardware.com

Related coverage: skywork.ai

Copilot+ PC Requirements — The Ultimate Guide

Understand Microsoft Copilot+ PC AI requirements: 40+ TOPS NPU, RAM, storage, supported processors, and which features need Copilot+. Read the complete guide.

skywork.ai
Related coverage: pcgamer.com

Microsoft plans to launch a cheaper 8 GB Surface laptop later this year which won't meet the requirements of a Copilot+ PC | PC Gamer

How far we have fallen.

www.pcgamer.com
Official source: news.microsoft.com

PowerPoint Presentation

PDF document

news.microsoft.com
Official source: cdn-dynmedia-1.microsoft.com

!! SpecificProject Only -MSSurface_Logo_horizontal_C-CoPilot Gray_RGB

PDF document

cdn-dynmedia-1.microsoft.com
Related coverage: na.ingrammicro.com

Copilot PC Surface Pro for Business Intel Product FAQ

PDF document

na.ingrammicro.com
Related coverage: dandh.com

The Role of the AI PC in Your Next Fleet Refresh

PDF document

www.dandh.com

ChatGPT · Jun 16, 2026

Microsoft is testing Windows AI APIs that let supported Nvidia GeForce RTX 30-series and newer GPUs with at least 6GB of VRAM run local language-model workloads through Windows App SDK 2.2 Experimental 9, currently requiring Windows Insider builds, Developer Mode, and updated drivers. That is a small developer preview with much larger implications. Microsoft spent the Copilot+ launch cycle teaching users that the NPU was the passport to “real” Windows AI; now it is quietly acknowledging that many capable Windows PCs already have another AI accelerator installed. The result is not the end of Copilot+ PCs, but it is the beginning of a more complicated and more honest Windows AI story.

Microsoft’s NPU Wall Was Always Too Neat to Last

The Copilot+ PC pitch was tidy because hardware marketing likes tidy borders. A device either had a qualifying neural processing unit, or it did not. If it crossed Microsoft’s threshold, it could claim a new class of Windows experiences; if it did not, it remained a conventional Windows 11 PC, no matter how expensive, powerful, or recent it happened to be.
That clarity helped Microsoft, Qualcomm, AMD, and Intel explain a new category to buyers. It also let OEMs sell a refresh cycle at a moment when PC replacement demand needed a reason to exist beyond thinner bezels and better battery life. The 40 TOPS NPU became a shorthand for readiness, even if the real-world usefulness of many Copilot+ features remained uneven.
But Windows is not iOS, and the PC installed base does not obey clean product lines. Enthusiasts have desktops with RTX cards that dwarf mobile NPUs in raw compute. Workstations have discrete GPUs bought for rendering, development, simulation, and gaming. Laptops sold before the Copilot+ branding era may still have the silicon needed to do useful local inference.
That made the original boundary feel artificial from the start. NPUs are efficient and well-suited to background AI workloads, but they are not the only way to run a model locally. Microsoft’s experimental GPU support is a concession to the obvious: Windows AI cannot scale if it pretends the only useful accelerator is the one OEMs started highlighting in 2024.

The Experimental Switch That Changes the Argument

The new support arrives through Windows App SDK 2.2 Experimental 9, not through a polished consumer feature update. That distinction matters. This is not Microsoft flipping a switch that suddenly enables every Copilot+ feature on every gaming PC.
What is being tested is a developer-facing path for Windows AI language-model APIs to run on supported GPUs. Reports and Microsoft-facing developer material point first to Nvidia RTX 30-series and newer GPUs with at least 6GB of VRAM. The preview also sits behind the usual early-access gates: Windows Insider builds, Developer Mode, updated GPU drivers, and the experimental SDK itself.
That makes it easy to understate the move. Experimental SDK features often change, break, or disappear before reaching stable release. Developers who build against them are, by definition, volunteering to live close to the blast radius.
But platform direction is often visible first in the places normal users never look. Microsoft does not need to declare a new consumer policy for the implications to be clear. If the Windows AI stack can target CPU, GPU, NPU, and cloud execution paths, then the Copilot+ line becomes less a hard technical border and more a preferred configuration.

Windows AI Starts Looking Like DirectX, Not a Sticker Program

The smarter long-term model is not “AI feature equals NPU feature.” It is closer to how Windows has historically handled graphics, media, and acceleration. The platform exposes APIs, the runtime chooses suitable hardware, and developers target capabilities rather than product badges.
That is the more mature version of Microsoft’s Windows AI strategy. An app should not have to care whether a rewrite suggestion, object extraction model, or image enhancement pipeline runs on an NPU, a discrete GPU, an integrated GPU, a CPU fallback, or a cloud service. It should care about availability, latency, power draw, privacy posture, and quality.
This is where the Windows App SDK matters. Microsoft has been trying to give developers a modern layer for Windows desktop apps that is less tied to old app-model divisions. If AI APIs live there, Microsoft can make local inference feel like a platform primitive instead of a one-off demo welded to a specific laptop launch.
That is also why the RTX support is strategically important even if the first wave is narrow. Nvidia’s GPUs are already familiar to developers as compute devices. CUDA, TensorRT, ONNX Runtime integrations, and years of AI tooling mean the RTX base is not just large; it is technically legible to the people likely to experiment with Windows AI APIs first.

Copilot+ Loses Exclusivity, Not Relevance

It is tempting to frame GPU support as Microsoft undermining Copilot+ PCs. In one sense, it is. If a desktop with an RTX 3060 can run some local Windows AI workloads, then the idea that only a new NPU-equipped laptop can participate starts to look like marketing rather than architecture.
But exclusivity was never the strongest argument for Copilot+ hardware. Efficiency was. A discrete GPU can be fast, but it is not necessarily the best place to run small, persistent, background inference tasks on battery. Fire up an RTX laptop GPU for routine language-model work and the performance may be excellent, but the power profile will not resemble an NPU sipping energy while the rest of the system stays quiet.
That leaves Copilot+ PCs with a real role. NPUs remain attractive for always-on or frequent lightweight operations: semantic indexing, image analysis, live captions, translation, camera effects, and other tasks that users do not want to feel as a fan curve. The best argument for NPUs is not that GPUs cannot do AI; it is that GPUs should not have to do every AI task.
Microsoft’s challenge is that the original messaging blurred that distinction. Copilot+ made the NPU sound like the key to the kingdom. The GPU preview reframes it as one accelerator among several, with better battery economics but not exclusive moral authority.

Nvidia Gets the First Practical Win

The preview’s RTX-first shape is not surprising. Nvidia has spent years turning its gaming GPUs into a mass-market AI compute platform. The company’s consumer cards are not just graphics hardware; they are the most common local AI accelerator many Windows users own.
The 6GB VRAM floor is also telling. It excludes many older and lower-end cards, but it captures a meaningful slice of the RTX 30-series and newer market. It suggests Microsoft is aiming at practical local model execution rather than a theoretical “any GPU will do” compatibility story that collapses under memory pressure.
For Nvidia, the optics are excellent. The company has already been pushing the phrase “AI PC” beyond the NPU-centric definition preferred by Microsoft’s OEM partners. RTX owners have been running local models, image generators, upscalers, and AI-assisted creative tools for years. Windows-level support gives that existing behavior a more official platform channel.
For AMD and Intel, the preview raises an obvious question: when does broader GPU support arrive, and through which execution providers? Microsoft’s public Windows AI direction points toward multiple hardware paths, but early implementation choices shape developer assumptions. If the first usable GPU path is Nvidia, developers will naturally test against Nvidia first.
That is how Windows platform gravity works. The first path that works reliably becomes the de facto target, even if Microsoft later widens the matrix. AMD and Intel can still win important ground, particularly with integrated GPUs and NPUs in mainstream laptops, but they cannot afford to let Windows AI tooling become another CUDA-shaped ecosystem by inertia.

The Features Users Care About Are Still Unevenly Distributed

The feature list associated with Windows AI is broad enough to invite confusion. Text rewriting, language-model APIs, Photos Super Resolution, object extraction, erase tools, image description, semantic search, and other experiences do not all have the same hardware requirements or product status. Some are app features. Some are APIs. Some are tied to Copilot+ branding. Some are rolling through Insider channels. Some may use local models, cloud services, or both depending on the device and implementation.
That complexity is the tax Microsoft pays for turning AI into a platform layer while simultaneously selling it as a consumer feature bundle. A developer reads “Language Model APIs on GPU” and sees a new local inference target. A user reads “Windows AI on RTX GPUs” and may reasonably expect Recall, Click to Do, Photos tricks, and Copilot behavior to change overnight.
They should not. Experimental GPU support does not mean every Copilot+ feature is suddenly unlocked on every RTX system. It means Microsoft is testing a path that allows certain local AI workloads to run beyond the original NPU-only class of machines.
That distinction will matter for support desks. Users will not parse API families, execution providers, model availability, and Insider channel requirements. They will ask why their RTX card works for one AI feature but not another, or why a desktop runs a demo while a laptop with a different GPU does not.
Microsoft can solve some of that with better capability reporting. Windows already exposes GPU, CPU, memory, and driver details; AI readiness needs the same level of clarity. If “AI features available” becomes another vague Settings banner, administrators and users will spend the next year reverse-engineering what their PCs can actually do.

Local AI Is Also a Privacy Argument

The GPU move is not only about speed or inclusivity. It also intersects with the most sensitive part of Microsoft’s AI push: trust. The more Windows can do locally, the less Microsoft has to ask users to accept that their files, images, prompts, and context are being shipped elsewhere for processing.
That matters because Microsoft’s AI rollout has been shadowed by privacy anxiety. Recall became the obvious example, but the concern is broader. Users are not merely worried that AI features will be useless; they are worried that useful AI features will be too hungry for personal context.
Local execution does not automatically make a feature safe. A poorly designed local index can still expose sensitive data. A badly permissioned model cache can still create enterprise headaches. A local AI tool can still produce incorrect output, leak information across user boundaries, or widen the attack surface.
But locality gives Microsoft a stronger starting position. Running a rewrite model, object detector, or image enhancement pipeline on hardware inside the user’s PC is easier to defend than sending the same job to a remote service. For enterprises, local processing can also simplify policy conversations, even if it does not eliminate them.
This is where GPUs help. If Microsoft can use the installed RTX base to keep more AI work on-device, it can reduce cloud dependency without forcing every interested user to buy a new Copilot+ PC. That is both a technical and political advantage.

The Cloud Fallback Remains the Quiet Escape Hatch

Microsoft is not abandoning the cloud. It cannot. Some models will be too large, some tasks too expensive locally, and some experiences too tied to Microsoft’s online services to run entirely on the PC. Copilot itself remains deeply cloud-shaped, even as Windows gains more local AI plumbing.
The emerging architecture is hybrid by necessity. Local hardware handles work that benefits from low latency, privacy, offline access, or low marginal cost. Cloud services handle larger models, fresh information, cross-device state, and features Microsoft wants to update continuously.
That hybrid model is sensible, but it requires honesty. Users should know when a feature runs locally and when it leaves the machine. Administrators should be able to enforce that boundary. Developers should have APIs that expose capability and policy clearly enough that “local-first” does not become a marketing phrase with hidden exceptions.
GPU support makes the hybrid story more credible because it increases the number of machines that can do meaningful work locally. It also makes the boundary more variable. Two Windows 11 PCs may both be “AI-capable,” but one may have an NPU, one may have an RTX card, one may have both, and one may rely primarily on cloud fallback.
That variability is normal for Windows. It is also exactly why Microsoft needs to treat AI capability like a managed platform concern, not a seasonal branding campaign.

Developers Get a Bigger Playground and a Bigger Test Matrix

For developers, the preview is good news wrapped in familiar Windows complexity. The good news is obvious: the addressable market for local Windows AI experiences grows if RTX GPUs can participate. An app that once targeted only Copilot+ PCs can begin imagining a broader set of users.
The complexity is equally obvious. Developers now have to think about multiple execution paths, hardware classes, driver versions, VRAM limits, model availability, and performance characteristics. A feature that feels instant on one machine may be slow, unavailable, or power-hungry on another.
That is not new in Windows development. Games have lived with capability tiers for decades. Creative apps already scale across CPU and GPU configurations. The difference is that AI features often feel binary to users: either the button exists and works, or it does not.
Microsoft’s API design will therefore matter more than its keynote language. Developers need clean capability checks, predictable fallbacks, good error states, and documentation that does not require spelunking through Insider release notes. If those pieces are weak, GPU support will produce demos, not dependable apps.
The upside is substantial. Local text transformation, summarization, semantic search, image cleanup, and media enhancement are exactly the kinds of features that third-party Windows apps can use without becoming “AI apps” in the hype-cycle sense. The less developers have to package models and runtimes themselves, the more likely Windows AI becomes a normal part of application design.

Sysadmins Should Read “Experimental” in Red Ink

Enterprise IT should not treat this preview as a deployment signal. Experimental SDK support, Insider builds, and Developer Mode are not ingredients for managed production fleets. The right posture today is observation, not enablement.
Still, administrators should pay attention because this is the shape of future support tickets. Users with powerful GPUs will expect AI features to work. Developers inside organizations will begin testing local model APIs. Security teams will need policies for where models live, what data they process, and whether outputs are logged, indexed, or retained.
Driver management will also become more important. If Windows AI features depend on GPU execution providers and vendor runtimes, then display drivers become part of the AI reliability chain. That is not a comforting thought for anyone who has had to stabilize creative workstations, CAD machines, or gaming-class laptops in a corporate environment.
There is also procurement fallout. For the past two years, buyers have been told to look for NPUs when planning AI-ready Windows fleets. Now the answer becomes more nuanced. An NPU may be preferred for mobile productivity systems, while a GPU may be more relevant for workstations, developer machines, and creator PCs.
That nuance is healthier than the sticker version. It is also harder to express in a purchasing spreadsheet.

The Inclusivity Argument Is Real, but Not Universal

The most user-friendly reading of the preview is that Microsoft is opening Windows AI to PCs people already own. That is broadly true. A desktop with an RTX 3060 or better may now be closer to the Windows AI future than Microsoft’s original Copilot+ boundary suggested.
That matters for enthusiasts who upgrade GPUs more often than entire PCs. It matters for gamers whose hardware has been treated as irrelevant to the official AI PC story despite being extremely capable. It matters for creators and developers who bought RTX systems for other reasons and now get another use for that silicon.
But inclusivity has limits. RTX 20-series owners are apparently outside the first wave. Systems with less than 6GB of VRAM are out. AMD and Intel discrete GPU users are not the headline beneficiaries. Many mainstream laptops without strong NPUs or GPUs will still rely on CPU paths or the cloud.
There is also a battery-life divide. A desktop RTX card running local AI is a very different proposition from a thin-and-light laptop trying to preserve all-day mobility. Microsoft can widen access without making all hardware equally suited to all AI tasks.
The better conclusion is not “NPUs are useless.” It is that Windows AI is becoming heterogeneous. The PC ecosystem already is; Microsoft is finally letting its AI platform admit it.

Recall Still Haunts the Room

Any discussion of local Windows AI eventually runs into Recall, whether Microsoft wants it there or not. Recall became the symbol of the Copilot+ era because it combined technical ambition with an unusually intimate data model. It also forced Microsoft to relearn that users and administrators judge AI features less by demo magic than by failure modes.
GPU support does not answer the Recall debate. It does, however, complicate the assumptions around feature eligibility. If certain language and vision models can run on RTX hardware, users will ask why some flagship AI experiences remain tied to NPUs or Copilot+ branding.
There may be good answers. Some features need specific security architecture. Some need power-efficient background processing. Some need a constrained hardware baseline to provide supportable performance. Some are simply not ready to be widened.
Microsoft should make those distinctions explicit. If a feature is NPU-only because of security, say so. If it is NPU-only because of battery life, say so. If it is NPU-only because Microsoft is preserving a product tier, users will infer that too.
The RTX preview gives Microsoft an opportunity to reset the conversation around capability rather than exclusivity. It should take it.

The RTX Detour Points to the Real Windows AI Roadmap

The near-term story is a developer preview. The long-term story is a Windows runtime that brokers AI work across whatever compute is available. That is the only strategy that makes sense for a platform as messy and durable as the PC.
In that future, the NPU is not demoted; it is specialized. The GPU is not a hack; it is a high-throughput option. The CPU is not irrelevant; it remains the universal fallback. The cloud is not banished; it handles what local silicon cannot or should not.
Microsoft has been moving toward this with Windows AI APIs, Windows ML, Foundry Local, ONNX Runtime integrations, and hardware-specific execution providers. The RTX preview is one more piece, but it is a piece users can understand because the installed base is visible. People know whether their PC has an RTX card.
That visibility makes the shift politically potent. Microsoft no longer gets to say, implicitly or explicitly, that useful local AI begins only with a newly purchased Copilot+ machine. It now has to explain which workloads run where, and why.
That is good for the platform. Windows has always been strongest when it abstracts hardware diversity without pretending diversity does not exist. AI should be no different.

What RTX Owners Can Actually Take From This Preview

This preview is best read as a direction-of-travel signal, not a consumer rollout. The practical consequences are narrower than the headlines, but they are concrete enough to matter.

Microsoft is testing GPU execution for Windows AI language-model APIs through Windows App SDK 2.2 Experimental 9, rather than broadly unlocking every Copilot+ feature for RTX PCs.
The first reported supported GPU class is Nvidia GeForce RTX 30-series and newer hardware with at least 6GB of VRAM, alongside Windows Insider builds, Developer Mode, updated drivers, and the experimental SDK.
Copilot+ PCs still have a strong efficiency argument because NPUs are better suited to low-power, frequent, and background AI tasks than discrete GPUs.
RTX support expands the potential audience for local Windows AI apps, especially desktops, creator systems, gaming PCs, and developer workstations that already have capable Nvidia hardware.
IT departments should treat the feature as experimental and begin planning policy, driver, and support models rather than deploying it broadly.
The larger shift is from a badge-based AI PC story to a capability-based Windows AI platform that can use NPUs, GPUs, CPUs, and cloud services as conditions require.

Microsoft’s RTX experiment is a quiet admission that the Windows AI future cannot be fenced inside one hardware definition. The company still needs Copilot+ PCs, and NPUs still solve real problems, but the broader Windows ecosystem was never going to wait politely for a refresh cycle before doing local AI. If Microsoft can turn this preview into a stable, transparent, policy-aware platform, Windows AI may finally become less about which sticker is on the palm rest and more about what the machine in front of you can actually do.

References

Primary source: Technobaboy
Published: 2026-06-16T05:00:11.510528

Microsoft tests Windows AI tools on RTX GPUs - Technobaboy

Microsoft is testing Windows AI features on RTX GPUs, expanding support beyond NPUs for more PCs. Details here.

www.technobaboy.com
Related coverage: tomshardware.com

Microsoft is reportedly testing Copilot+ AI features with discrete GPUs instead of NPUs — a feature available on Windows App SDK with a Windows Insider Experimental Channel build and Developer Mode turned on | Tom's Hardware

Is this the beginning of the end for Copilot+ PCs?

www.tomshardware.com
Related coverage: techradar.com

Microsoft is bringing AI features to more Windows 11 PCs — just in case you were under the impression that AI was being cut back | TechRadar

There's no need for an NPU for certain AI features now, as an Nvidia GPU will do the job

www.techradar.com
Official source: learn.microsoft.com

Windows App SDK 2.0 release notes - Windows apps | Microsoft Learn

Provides information about what's new in Windows App SDK 2.0.

learn.microsoft.com
Related coverage: techspot.com

Microsoft is now letting Nvidia GPUs run local AI features that were locked to Copilot+ PCs | TechSpot

When Copilot+ PCs launched on June 18, 2024, the messaging was clear: dedicated AI hardware was essential. These machines were defined in part by their neural processing...

www.techspot.com
Official source: developer.microsoft.com

Windows SDK overview - Windows apps | Microsoft Learn

Learn about the Windows SDK, benefits it provides to developers, what is ready for developers now, and how to give feedback.

developer.microsoft.com

Related coverage: pcworld.com

Microsoft tests Windows AI features on RTX GPUs, not just NPUs | PCWorld

An experimental version of Microsoft's Windows App SDK, the foundation of many Windows AI capabilities, is being made available to PCs with GPU, not just NPUs. It's a signal that times are changing.

www.pcworld.com
Related coverage: overclock3d.net

Are NPUs useless? - Windows 11's Local AI is getting GPU support - OC3D

Microsoft is adding Nvidia GPU support to its Local AI tools, making dedicated NPU hardware practically useless.

overclock3d.net
Official source: support.microsoft.com

KB5096139: Nvidia TensorRT-RTX Execution Provider update (version 2.2605.1.0) - Microsoft Support

support.microsoft.com
Official source: marketingassets.microsoft.com

MS-Azure_logo_horiz_c-white_rgb

A football player gets ready to hiking a football.

marketingassets.microsoft.com
Official source: github.com

https://github.com/microsoft/windowsappsdk/releases
Related coverage: winbuzzer.com

Microsoft Tests Phi Silica for Windows AI on Nvidia GPUs

Microsoft is testing Phi Silica local AI models on Nvidia RTX GPUs for Windows PCs, widening options while keeping support experimental and developer-gated.

winbuzzer.com
Related coverage: rocm.docs.amd.com

https://rocm.docs.amd.com/_/downloads/install-on-windows/en/develop/pdf

Navigation section

Windows 11 Local AI APIs Expand to NVIDIA RTX—Copilot+ Badge Gets Cracked

The First Crack Is an API, Not Recall​

The NPU Still Has a Job, Just Not the Job Microsoft Sold First​

NVIDIA Gets Pulled Back Into the Windows AI Center of Gravity​

Developers Care Less About Badges Than Addressable Hardware​

Privacy Becomes More Credible When Local AI Stops Being Rare​

Recall Remains the Feature Microsoft Cannot Casually Unfence​

The Copilot+ Badge Starts Looking More Like Centrino Than Windows Itself​

Enterprise IT Will Read This as a Support Matrix Problem​

The Real Risk Is Another Half-Platform​

The RTX Door Rewrites the Copilot+ Fine Print​

References​

AI

Microsoft’s AI PC Wall Was Always Built on Efficiency, Not Capability​

The RTX Exception Turns a Badge Into a Negotiation​

Phi Silica Becomes a Windows Component, Not Just a Demo Model​

Developers Care Less About the Badge Than the Call​

The NPU Was Not a Lie, but the Story Was Too Small​

Enterprise IT Will See Promise Wrapped in Policy Risk​

Nvidia Gets the Installed Base Microsoft Needs​

The Consumer Message Gets Messier but More Truthful​

The Real Battle Is Over the Default AI Runtime​

The Copilot+ Line Is Thinner Than Microsoft First Drew It​

The New Rules Windows Users Should Actually Remember​

References​

AI

Microsoft’s Copilot+ Wall Was Always More Marketing Than Physics​

The First Crack Appears in the Developer Layer​

Phi Silica Becomes Less of a Copilot+ Ornament​

The NPU Still Has the Better Laptop Argument​

Copilot+ Exclusivity Meets the Installed Base​

Recall Remains the Line Microsoft Is Not Ready to Cross​

Nvidia Gets the Validation It Has Been Arguing For​

Developers Now Have a More Interesting Windows AI Pitch​

Local AI Is Becoming a Privacy Feature, Not Just a Performance Feature​

The Copilot+ Brand Looks Less Like a Destination and More Like a Tier​

The RTX Door Opens, but Only a Few Rooms Are Unlocked​

References​

AI

Microsoft’s NPU Wall Now Has a GPU-Sized Door in It​

Phi Silica Becomes the Test Case for a More Flexible Windows AI Stack​

The Copilot+ Badge Loses Some Exclusivity, Not Its Purpose​

Developers Get a Bigger Addressable Market, but Also a Bigger Testing Problem​

Nvidia Wins the First Round Because Windows AI Needs Real Silicon Today​

Recall Remains the Line Microsoft Is Not Crossing​

Local AI Is Becoming a Windows Distribution Problem​

The Privacy Pitch Is Real, but It Is Not Self-Executing​

The RTX Door Opens, but the House Is Still Under Construction​

References​

AI

Microsoft Just Broke Its Own Copilot+ Shortcut​

The First Crack Is Small, but It Is Real​

NPUs Were Sold as the Future, Not Just Another Backend​

The GPU Is the Wrong Tool Until It Is the Only One Users Already Have​

Copilot+ Was Always a Certification, Not a Law of Physics​

Recall Still Casts a Long Shadow​

The Silicon Economics Are More Subtle Than “Useless”​

Nvidia Gets the First Seat Because the Software Stack Is the Product​

Developers Needed Reach More Than Purity​

Enterprise IT Will See Both Opportunity and Another Policy Problem​

The Copilot Brand Has a Naming Problem, Not a Death Sentence​

This Is the Beginning of a Backend War Inside Windows​

The RTX Door Opens, but It Does Not Open the Whole House​

The Real Answer Is That NPUs Must Now Earn Their Keep​

The Shape of the Windows AI PC Is Finally Becoming Visible​

References​

AI

Microsoft’s NPU Wall Was Always Too Neat to Last​

The Experimental Switch That Changes the Argument​

Windows AI Starts Looking Like DirectX, Not a Sticker Program​

Copilot+ Loses Exclusivity, Not Relevance​

Nvidia Gets the First Practical Win​

The Features Users Care About Are Still Unevenly Distributed​

Local AI Is Also a Privacy Argument​

The Cloud Fallback Remains the Quiet Escape Hatch​

Developers Get a Bigger Playground and a Bigger Test Matrix​

The First Crack Is an API, Not Recall

The NPU Still Has a Job, Just Not the Job Microsoft Sold First

NVIDIA Gets Pulled Back Into the Windows AI Center of Gravity

Developers Care Less About Badges Than Addressable Hardware

Privacy Becomes More Credible When Local AI Stops Being Rare

Recall Remains the Feature Microsoft Cannot Casually Unfence

The Copilot+ Badge Starts Looking More Like Centrino Than Windows Itself

Enterprise IT Will Read This as a Support Matrix Problem

The Real Risk Is Another Half-Platform

The RTX Door Rewrites the Copilot+ Fine Print

References

Microsoft’s AI PC Wall Was Always Built on Efficiency, Not Capability

The RTX Exception Turns a Badge Into a Negotiation

Phi Silica Becomes a Windows Component, Not Just a Demo Model

Developers Care Less About the Badge Than the Call

The NPU Was Not a Lie, but the Story Was Too Small

Enterprise IT Will See Promise Wrapped in Policy Risk

Nvidia Gets the Installed Base Microsoft Needs

The Consumer Message Gets Messier but More Truthful

The Real Battle Is Over the Default AI Runtime

The Copilot+ Line Is Thinner Than Microsoft First Drew It

The New Rules Windows Users Should Actually Remember

References

Microsoft’s Copilot+ Wall Was Always More Marketing Than Physics

The First Crack Appears in the Developer Layer

Phi Silica Becomes Less of a Copilot+ Ornament

The NPU Still Has the Better Laptop Argument

Copilot+ Exclusivity Meets the Installed Base

Recall Remains the Line Microsoft Is Not Ready to Cross

Nvidia Gets the Validation It Has Been Arguing For

Developers Now Have a More Interesting Windows AI Pitch

Local AI Is Becoming a Privacy Feature, Not Just a Performance Feature

The Copilot+ Brand Looks Less Like a Destination and More Like a Tier

The RTX Door Opens, but Only a Few Rooms Are Unlocked

References

Microsoft’s NPU Wall Now Has a GPU-Sized Door in It

Phi Silica Becomes the Test Case for a More Flexible Windows AI Stack

The Copilot+ Badge Loses Some Exclusivity, Not Its Purpose

Developers Get a Bigger Addressable Market, but Also a Bigger Testing Problem

Nvidia Wins the First Round Because Windows AI Needs Real Silicon Today

Recall Remains the Line Microsoft Is Not Crossing

Local AI Is Becoming a Windows Distribution Problem

The Privacy Pitch Is Real, but It Is Not Self-Executing

The RTX Door Opens, but the House Is Still Under Construction

References

Microsoft Just Broke Its Own Copilot+ Shortcut

The First Crack Is Small, but It Is Real

NPUs Were Sold as the Future, Not Just Another Backend

The GPU Is the Wrong Tool Until It Is the Only One Users Already Have

Copilot+ Was Always a Certification, Not a Law of Physics

Recall Still Casts a Long Shadow

The Silicon Economics Are More Subtle Than “Useless”

Nvidia Gets the First Seat Because the Software Stack Is the Product

Developers Needed Reach More Than Purity

Enterprise IT Will See Both Opportunity and Another Policy Problem

The Copilot Brand Has a Naming Problem, Not a Death Sentence

This Is the Beginning of a Backend War Inside Windows

The RTX Door Opens, but It Does Not Open the Whole House

The Real Answer Is That NPUs Must Now Earn Their Keep

The Shape of the Windows AI PC Is Finally Becoming Visible

References

Microsoft’s NPU Wall Was Always Too Neat to Last

The Experimental Switch That Changes the Argument

Windows AI Starts Looking Like DirectX, Not a Sticker Program

Copilot+ Loses Exclusivity, Not Relevance

Nvidia Gets the First Practical Win

The Features Users Care About Are Still Unevenly Distributed

Local AI Is Also a Privacy Argument

The Cloud Fallback Remains the Quiet Escape Hatch

Developers Get a Bigger Playground and a Bigger Test Matrix