speech synthesis

  1. Microsoft Announces MAI-Voice-1 and MAI-1-Preview: In-House AI for Copilot

    Microsoft has quietly shipped its first fully in‑house AI models — MAI‑Voice‑1 and MAI‑1‑preview — marking a deliberate shift in strategy that reduces dependence on OpenAI’s stack and accelerates Microsoft’s plan to own more of the compute, models, and product surface area that power Copilot...
  2. Microsoft unveils MAI-Voice-1 and MAI-1-Preview: Product-driven in-house AI strategy

    Microsoft’s AI unit has publicly launched two in‑house models — MAI‑Voice‑1 and MAI‑1‑preview — signaling a deliberate shift from purely integrating third‑party frontier models toward building product‑focused models Microsoft can own, tune, and route inside Copilot and Azure. Background...
  3. OpenAI–Microsoft Restructuring Delayed Over API, IP and AGI Clause Talks

    OpenAI’s highly anticipated corporate restructuring has been pushed off the immediate calendar as last‑ditch negotiations with Microsoft over API access, intellectual property (IP) rights and a disputed “AGI clause” remain unresolved, forcing a delay that could push the overhaul into next year...
  4. Windows Ambience: Multimodal, Agentic AI with Copilot+ for Enterprise

    Microsoft’s Windows lead has just sketched a future in which the operating system becomes ambient, multimodal and agentic — able to listen, see, and act — a shift powered by a new class of on‑device AI and tight hardware integration that will reshape how organisations manage and secure Windows...
  5. VibeVoice: Open-Source Hour-Scale Multi-Speaker TTS for Research

    Microsoft’s new VibeVoice marks a striking shift in what open-source text-to-speech can do: from short, single-voice clips to hour‑scale, multi‑speaker spoken audio that resembles a produced podcast — and it’s available now for researchers and tinkerers to try. The framework packages a compact...
  6. Fujitsu's AI Auto Presentation: Transforming Workplace Presentations with AI Avatars

    Public speaking, or glossophobia, affects approximately 75% of individuals to some degree, making it one of the most prevalent phobias worldwide. In professional settings, this fear can be particularly debilitating, with some employees going to great lengths to avoid presentations, including...
  7. Microsoft Copilot's AI Podcasting Revolution: Voice-Driven Content for Everyone

    It sounds like science fiction: you type in nearly anything—a dense academic article, a vacation idea, the most recent mind-melting tech conference recap—and, within seconds, you’re greeted not with an essay, but with an upbeat, back-and-forth podcast, staged by two impossibly game virtual...
  8. The Future of Clear, Noisy-Resistant Synthetic Speech: How Machines Talk Like Humans

    It’s a time-honored ritual: you click play on your favorite digital assistant, and out comes the brisk, sometimes eerie, yet strikingly articulate voice—one that’s come a long way from the robotic monotones of the 1980s. But just how well do we truly understand these synthesized voices...
  9. Nova Sonic: Amazon’s Next-Gen AI Voice Model for Natural, Human-Like Conversations

    It starts with a spark — or perhaps, in this case, a sonic boom. Imagine asking your virtual assistant to book a dinner reservation, troubleshoot your Wi-Fi, or walk your grandmother through installing a security update… and instead of the stilted, uncanny valley exchanges we’ve come to expect...
  10. A

    Text to speech softwares

    Hi, If you use any text to speech software's that are free, I would love to hear about them, thanks in advance.
  11. text to speech converter

    Hi. I need software for voice changer used for wave file (not online) similar to convert text to speech which exist (built) in windows 10. windows one is very basic with only one option. Please help. Thanks and best regards.
  12. Getting personal – speech and inking (App Dev on Xbox series)

    The way users interact with apps on different devices has gotten much more personal lately, thanks to a variety of new Natural User Interface features in the Universal Windows Platform. These UWP patterns and APIs are available for developers to easily bring in capabilities for their apps that...
  13. FamilyNotes: (Spoken) words and pictures

    FamilyNotes is a Windows 10 Universal Windows Platform (UWP) app that implements a group noticeboard. The goal of this app was to showcase the various Windows 10 input and interaction features that enable a personal and individualized computing experience. This is the third of three blog posts...
  14. Introducing the Speech Synthesis API in Microsoft Edge

    Starting with the Windows 10 Anniversary Update, Microsoft Edge will support the Speech Synthesis APIs defined in the W3C Web Speech API Specification. These APIs allow websites to convert text to audible speech with customizable voice and language settings. With them, website developers can add...
  15. Using speech in your UWP apps: From talking to conversing

    In the previous article, we introduced the idea of recognizing speech inside of a Windows 10 Universal Windows Platform (UWP) app and took a look at the SpeechRecognizer class and some of what it can do to enable speech recognition in our apps. In this article, we’re going to dig further into...
  16. Using speech in your UWP apps: It’s good to talk

    As developers, we adapt as technologies move from the realm of Science Fiction into readily available SDKs. That’s certainly, or perhaps especially, true for speech technologies. In the past 5 years, devices have become more personal and demanding of new forms of interaction. In Windows 10...
  17. App Development with Cortana | Visual Studio Toolbox

    In this episode, Robert is joined by Link Removed, who shows us how to integrate Cortana into apps. Among the topics Nick covers and shows are voice commands, speech recognition and synthesis, background voice commands and continuous dictation. Resources: Link Removed Nick's Demos Link...
  18. Assistive Context-Aware Toolkit (The open source app used by Professor Stephen Hawking)

    As soon as I read Mansib Rahman's post yesterday (as I write this) I knew I found the perfect project to highlight. I mean, come on it's Professor Stephen Hawking, Intel, .NET, WinForms (got to show some WinForm love now and then), open source and it's just cool! Link Removed I’m typing this...
  19. "Cortana, show me how I can add you to my app's..."

    Taking a breather from Visual Studio Extensions, today our Mobile Monday project shows you how you can build Cortana enabled projects... Link Removed ... You’ve likely already read about how Cortana will use the power of Bing to deliver personalized, natural experiences to users. What you may...
  20. VIDEO Vocoder

    :razz: :cool: :up: