text-to-speech

  1. ChatGPT

    Microsoft's MAI: In-House MAI-Voice-1 and MAI-1-Preview Reshape Copilot and Azure

    Microsoft has quietly crossed a strategic Rubicon: after years of tight integration with OpenAI, the company has begun shipping its own first-party foundation models — notably MAI-Voice-1 and MAI-1-preview — and is positioning them inside Copilot and Azure as the start of a long-term bid to...
  2. ChatGPT

    MAI-Voice-1: Expressive Audio in Copilot Labs Audio Expressions

    Microsoft’s latest Copilot experiment turns text into talk — and, in early tests, it sounds more like a collaborator than a canned text‑to‑speech bot. The company has quietly introduced MAI‑Voice‑1, a high‑throughput speech generation model surfaced in a new Copilot Labs experience called Audio...
  3. ChatGPT

    Microsoft unveils in-house AI models MAI-Voice-1 and MAI-1-preview

    Microsoft’s AI group quietly cut the ribbon on two home‑grown foundation models on August 28, releasing a high‑speed speech engine and a consumer‑focused text model that together signal a strategic shift: Microsoft intends to build its own AI muscle even as its long, lucrative relationship with...
  4. ChatGPT

    Windows Ambience: Multimodal, Agentic AI with Copilot+ for Enterprise

    Microsoft’s Windows lead has just sketched a future in which the operating system becomes ambient, multimodal and agentic — able to listen, see, and act — a shift powered by a new class of on‑device AI and tight hardware integration that will reshape how organisations manage and secure Windows...
  5. ChatGPT

    VibeVoice: Open-Source Hour-Scale Multi-Speaker TTS for Research

    Microsoft’s new VibeVoice marks a striking shift in what open-source text-to-speech can do: from short, single-voice clips to hour‑scale, multi‑speaker spoken audio that resembles a produced podcast — and it’s available now for researchers and tinkerers to try. The framework packages a compact...
  6. ChatGPT

    VibeVoice-1.5B: Open-Source Long-Form Multi-Speaker TTS for Research

    Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...
  7. ChatGPT

    Unlock Accessibility: How Windows Magnifier Reading Enhances Screen Accessibility

    Magnifier is an essential accessibility feature built into Windows that helps users with low vision to better interact with their screens. One often underutilized but incredibly powerful capability of Magnifier is its ability to read text aloud, converting visible on-screen information into an...
  8. ChatGPT

    Enhance Your Reading with Microsoft Edge's Immersive Reader: Features & How-To Guide

    Microsoft Edge's Immersive Reader is a powerful tool designed to enhance the online reading experience by simplifying webpage layouts, removing distractions, and offering customizable features to suit individual preferences. Originally developed to assist readers with dyslexia and dysgraphia...
  9. ChatGPT

    Microsoft Edge Immersive Reader: Your Guide to Accessibility & Focused Reading

    Reading online content can be a daunting task in today’s digital landscape. Webpages are often cluttered with advertisements, popups, and design elements that distract readers from the main text. In response, browser developers continually strive to incorporate tools that facilitate focus...
  10. ChatGPT

    The Future of Clear, Noisy-Resistant Synthetic Speech: How Machines Talk Like Humans

    It’s a time-honored ritual: you click play on your favorite digital assistant, and out comes the brisk, sometimes eerie, yet strikingly articulate voice—one that’s come a long way from the robotic monotones of the 1980s. But just how well do we truly understand these synthesized voices...
  11. ChatGPT

    Microsoft Teams’ Real-Time Multilingual Interpreter: Revolutionizing Global Collaboration

    If you had wandered into the corridors of Microsoft Digital just a few years ago, you might have heard the telltale echoes of well-meaning multilingual confusion: a French phrase spliced with English, a Japanese idiom offered in tentative tones, and perhaps a heartfelt “Can you repeat that?”...
  12. ChatGPT

    Discover the Next Generation of AI with Microsoft's o3 and o4-mini Models on Azure

    The advent of the o3 and o4-mini models on the Microsoft Azure OpenAI Service marks a thrilling leap into the next generation of AI reasoning. These latest entries in the o-series, unveiled within Azure AI Foundry and GitHub, don't merely build upon past versions—they shatter previous benchmarks...
  13. ChatGPT

    Microsoft Unveils GPT-4o Mini Audio Models for Azure AI

    Microsoft is once again pushing the envelope in AI innovation with the release of its new GPT-4o mini audio models, now available in preview on Azure AI Services. Targeted at developers and enterprises alike, these new models promise to deliver efficient speech-to-text and text-to-speech...
  14. ChatGPT

    Microsoft Copilot Introduces Read-Aloud Feature: A Game-Changer for Accessibility

    Ever dreamed of a world where your AI assistant not only writes brilliant responses but also narrates them like a tech-savvy audiobook? Wait no more—the future is here, and Microsoft Copilot is leading the charge with its latest enhancement: read-aloud support for chat responses. Set to launch...
  15. News

    Introducing the Speech Synthesis API in Microsoft Edge

    Starting with the Windows 10 Anniversary Update, Microsoft Edge will support the Speech Synthesis APIs defined in the W3C Web Speech API Specification. These APIs allow websites to convert text to audible speech with customizable voice and language settings. With them, website developers can add...
  16. News

    Using speech in your UWP apps: It’s good to talk

    As developers, we adapt as technologies move from the realm of Science Fiction into readily available SDKs. That’s certainly, or perhaps especially, true for speech technologies. In the past 5 years, devices have become more personal and demanding of new forms of interaction. In Windows 10...
  17. L

    Windows application that highlights and reads words aloud on pdf itself?

    Requirements: Read aloud words from a PDF file Shows the PDF file and highlights the words as it reads On The Pdf Itself Free Works on Windows 8.1 What I have tried I've tried Adobe Reader Read Out Loud. I heard nothing, and as far as I can tell, the program does not highlight words as it...
  18. News

    Use inking and speech to support natural input (10 by 10)

    With Windows 10, it’s now easier than ever to support natural input in your apps and today we’d like to highlight using inking and speech to interact more naturally with your users. Digital inking with DirectInk Despite the introduction and evolution of all types of computer input devices...
  19. News

    "Cortana, show me how I can add you to my app's..."

    Taking a breather from Visual Studio Extensions, today our Mobile Monday project shows you how you can build Cortana enabled projects... Link Removed ... You’ve likely already read about how Cortana will use the power of Bing to deliver personalized, natural experiences to users. What you may...
  20. K

    Windows 7 How to Get Croatian Text-to-Speech Voice for Microsoft Narrator

    Where can I get a Croatian text-to-speech voice so that Microsoft Narrator can speak in Croatian?
Back
Top