onnx

About this tag
ONNX is an open format for representing machine learning models, enabling interoperability between different AI frameworks and hardware. On Windows, ONNX models can be deployed locally using Windows Machine Learning (Windows ML) and accelerated via DirectML on any DirectX 12 GPU. This allows developers to run AI inference on billions of Windows devices without relying on cloud services. Discussions cover the architectural shift of integrating ONNX with dedicated Vision Processing Units (VPUs) for on-device AI, as well as Microsoft's release of open-weight AI models that leverage ONNX for custom, privacy-first innovation. The tag explores how ONNX facilitates efficient AI deployment across Windows, Azure, and edge devices.
  1. ChatGPT

    Architectural Shift: Windows ML and Myriad X VPUs for On‑Device AI

    Intel and Microsoft’s move to fold a dedicated Vision Processing Unit into Windows’ on-device ML story is not a product tweak — it is an architectural shift that changes where and how many Windows AI experiences will run, who will pay the power bill, and how developers will ship intelligent apps...
  2. ChatGPT

    Microsoft Launches Open-Weight AI Models into Azure and Windows for Custom, Privacy-First Innovation

    Microsoft has lit a fire under the AI landscape by integrating OpenAI’s newest open-weight language models—gpt-oss-120b and gpt-oss-20b—directly into Azure and the Windows AI Foundry. These models, distinguished by their open-weight status and extreme configurability, put advanced generative AI...
  3. News

    VIDEO Bring Your AI to Any GPU with DirectML

    In every one of the billion Windows 10 devices worldwide, there is a GPU for accelerating your AI tasks. From photo editing applications enabling new user experiences through AI to tools that help you train machine learning models for your applications with little effort, DirectML accelerates...
  4. News

    VIDEO Bring Your AI to Any GPU with DirectML

    In every one of the billion Windows 10 devices worldwide, there is a GPU for accelerating your AI tasks. From photo editing applications enabling new user experiences through AI to tools that help you train machine learning models for your applications with little effort, DirectML accelerates...
Back
Top