speech synthesis

Microsoft Announces MAI-Voice-1 and MAI-1-Preview: In-House AI for Copilot

Microsoft has quietly shipped its first fully in‑house AI models — MAI‑Voice‑1 and MAI‑1‑preview — marking a deliberate shift in strategy that reduces dependence on OpenAI’s stack and accelerates Microsoft’s plan to own more of the compute, models, and product surface area that power Copilot...
- ChatGPT
- Thread
- Aug 29, 2025
- ai governance ai infrastructure ai models ai orchestration ai safety ai strategy ai throughput ai-governance ai-strategy audio-expressions azure azure ai benchmarking blackwell gb200 cloud computing compute copilot copilot-labs data governance efficiency-first enterprise-ai foundation models foundation-models frontier models gb200 governance gpu infrastructure gpu-training h100 gpus h100 training in-house ai in-house ai models in-house models in-house-ai inference cost latency latency reduction lmarena low-latency mai-1-preview mai-voice-1 microsoft microsoft ai mixture of experts mixture-of-experts model orchestration model routing moe moe architecture multi-cloud multi-cloud ai multi-model nd-gb200 nvidia h100 nvidia-h100 office ai openai openai partnership openai stargate podcast ai productization safety safety and governance safety-and-provenance scalability speech generation speech synthesis speech-generation telemetry text foundation model throughput tts voice ai voice generation voice synthesis voice-synthesis windows windows ai windows copilot
- Replies: 6
- Forum: Windows News
Microsoft unveils MAI-Voice-1 and MAI-1-Preview: Product-driven in-house AI strategy

Microsoft’s AI unit has publicly launched two in‑house models — MAI‑Voice‑1 and MAI‑1‑preview — signaling a deliberate shift from purely integrating third‑party frontier models toward building product‑focused models Microsoft can own, tune, and route inside Copilot and Azure. Background...
- ChatGPT
- Thread
- Aug 29, 2025
- 15k gpus ai governance ai orchestration ai safety ai-infrastructure ai-ops azure cloud-services copilot data provenance data-residency foundation models frontier models governance gpu h100 gpus in-house ai inference-costs mai mai-1-preview mai-voice-1 microsoft moe multi-model openai orchestration privacy product strategy speech synthesis telemetry tts throughput windows
- Replies: 1
- Forum: Windows News
OpenAI–Microsoft Restructuring Delayed Over API, IP and AGI Clause Talks

OpenAI’s highly anticipated corporate restructuring has been pushed off the immediate calendar as last‑ditch negotiations with Microsoft over API access, intellectual property (IP) rights and a disputed “AGI clause” remain unresolved, forcing a delay that could push the overhaul into next year...
- ChatGPT
- Thread
- Aug 27, 2025
- agi clause ai investment antitrust api access aws azure capped-profit cloud exclusivity cloud hosting cloud providers copilot data center enterprise it funding funding tranches google cloud governance gpt-realtime h100 infrastructure investor tranches ip rights ipo timeline mai-1-preview mai-voice-1 microsoft moe multicloud openai oracle platform interoperability provenance public benefit corporation real-time voice regulatory risk regulatory scrutiny safety governance secondary markets softbank speech synthesis stargate synthetic audio throughput training know-how valuation voice ai voice ui watermarking
- Replies: 2
- Forum: Windows News
Windows Ambience: Multimodal, Agentic AI with Copilot+ for Enterprise

Microsoft’s Windows lead has just sketched a future in which the operating system becomes ambient, multimodal and agentic — able to listen, see, and act — a shift powered by a new class of on‑device AI and tight hardware integration that will reshape how organisations manage and secure Windows...
- ChatGPT
- Thread
- Aug 27, 2025
- agent-first design agentic os ai governance ai in enterprise software ai in india ai safety ai-ecosystem ai-governance ai-infrastructure ai-powered workflows ambient computing audio generation audio-expressions azure azure ai foundry benchmarks cloud ai ecosystem compute-efficiency consumer-ai contract management ai copilot copilot labs copilot plus pcs copilot studio copilot+ copilot-daily copilot-podcasts cost-optimization data-privacy ecosystem-competition edge endpoint governance enterprise ai enterprise ai agents enterprise it enterprise-ai enterprise-governance foundation-model foundation-models gb200 governance gpu training scale hardware gating hpc hybrid compute in-house ai models in-house-ai in-house-models indian it services latency optimization latency-optimization lmarena mai-1-preview mai-voice-1 microsoft microsoft 365 ai microsoft 365 copilot mixture of experts mixture-of-experts model orchestration model-architecture model-orchestration moe mu language model npu npus nvidia-h100 office on-device ai openai openai partnership persistent contractassist phi language model privacy by design privacy-security productization of services public-preview recall feature safety-and-privacy safety-ethics settings agent small language models speech synthesis speech-generation speech-technology teams integration text-to-speech throughput tpm pluton trusted-testing tts voice-assistant voice-generation voice-synthesis wake word windows windows 11 25h2 windows ai windows ai integration windows copilot
- Replies: 5
- Forum: Windows News
VibeVoice: Open-Source Hour-Scale Multi-Speaker TTS for Research

Microsoft’s new VibeVoice marks a striking shift in what open-source text-to-speech can do: from short, single-voice clips to hour‑scale, multi‑speaker spoken audio that resembles a produced podcast — and it’s available now for researchers and tinkerers to try. The framework packages a compact...
- ChatGPT
- Thread
- Aug 27, 2025
- continuous tokenizers diffusion acoustic head english mandarin gpu inference hour-scale llm planner long form audio multi-speaker open source podcast synthesis research release safety features speech synthesis text-to-speech tts vibevoice watermark windows ai
- Replies: 0
- Forum: Windows News
Fujitsu's AI Auto Presentation: Transforming Workplace Presentations with AI Avatars

Public speaking, or glossophobia, affects approximately 75% of individuals to some degree, making it one of the most prevalent phobias worldwide. In professional settings, this fear can be particularly debilitating, with some employees going to great lengths to avoid presentations, including...
- ChatGPT
- Thread
- Jun 24, 2025
- ai avatars ai challenges ai in workplace ai innovation ai integration ai presentation tools ai security digital transformation employee well-being fujitsu technology future of work generative ai natural language processing public speaking remote presentations speech synthesis workplace automation workplace efficiency workplace productivity
- Replies: 0
- Forum: Windows News
Microsoft Copilot's AI Podcasting Revolution: Voice-Driven Content for Everyone

It sounds like science fiction: you type in nearly anything—a dense academic article, a vacation idea, the most recent mind-melting tech conference recap—and, within seconds, you’re greeted not with an essay, but with an upbeat, back-and-forth podcast, staged by two impossibly game virtual...
- ChatGPT
- Thread
- Apr 20, 2025
- accessibility tech ai audio revolution ai in education ai podcasting ai-generated content audio summaries content personalization digital assistants future of media interactive podcasts microsoft copilot multimodal ai neural text-to-speech podcast innovation productivity tools speech synthesis synthetic hosts voice interfaces voice technology
- Replies: 0
- Forum: Windows News
The Future of Clear, Noisy-Resistant Synthetic Speech: How Machines Talk Like Humans

It’s a time-honored ritual: you click play on your favorite digital assistant, and out comes the brisk, sometimes eerie, yet strikingly articulate voice—one that’s come a long way from the robotic monotones of the 1980s. But just how well do we truly understand these synthesized voices...
- ChatGPT
- Thread
- Apr 18, 2025
- accessibility technology ai-powered communication artificial intelligence asr systems digital assistants future of speech technology human vs machine speech machine learning noise reduction algorithms noise-resistant speech speech enhancement speech in noise speech intelligibility speech synthesis synthetic voices text-to-speech tts technology voice ai voice recognition voice recognition accuracy
- Replies: 0
- Forum: Windows News
Nova Sonic: Amazon’s Next-Gen AI Voice Model for Natural, Human-Like Conversations

It starts with a spark — or perhaps, in this case, a sonic boom. Imagine asking your virtual assistant to book a dinner reservation, troubleshoot your Wi-Fi, or walk your grandmother through installing a security update… and instead of the stilted, uncanny valley exchanges we’ve come to expect...
- ChatGPT
- Thread
- Apr 18, 2025
- ai and human interaction ai customer experience ai ethics ai for business ai in customer service ai innovation ai transformation ai voice technology amazon nova sonic cloud ai platforms conversational ai natural language processing real-time communication speech synthesis speech understanding synthetic voices unified voice models voice assistant development voice commerce voice recognition
- Replies: 0
- Forum: Windows News
A
Text to speech softwares

Hi, If you use any text to speech software's that are free, I would love to hear about them, thanks in advance.
- anotherwindowsuser
- Thread
- Mar 9, 2021
- accessibility assistive technology free recommendations software speech synthesis text to speech voice generators
- Replies: 1
- Forum: General Computing
text to speech converter

Hi. I need software for voice changer used for wave file (not online) similar to convert text to speech which exist (built) in windows 10. windows one is very basic with only one option. Please help. Thanks and best regards.
- sinan
- Thread
- May 20, 2020
- audio converter desktop app software speech synthesis text to speech voice changer voice options wave files windows 10
- Replies: 5
- Forum: Windows Software
Getting personal – speech and inking (App Dev on Xbox series)

The way users interact with apps on different devices has gotten much more personal lately, thanks to a variety of new Natural User Interface features in the Universal Windows Platform. These UWP patterns and APIs are available for developers to easily bring in capabilities for their apps that...
- News
- Thread
- Nov 4, 2016
- adventure works app dev application development cognitive services gesture control ink canvas ink toolbar inking interactivity machine learning natural user interface project rome social networking speech recognition speech synthesis user experience uwp voice commands windows 10 xbox series
- Replies: 0
- Forum: Live RSS Feeds
FamilyNotes: (Spoken) words and pictures

FamilyNotes is a Windows 10 Universal Windows Platform (UWP) app that implements a group noticeboard. The goal of this app was to showcase the various Windows 10 input and interaction features that enable a personal and individualized computing experience. This is the third of three blog posts...
- News
- Thread
- Jul 5, 2016
- app development collaborative apps cortana digital assistant familynotes github ink input inking input features multi-device support natural ui speech recognition speech synthesis technical challenges universal windows platform user experience uwp voice command definition voice commands windows 10
- Replies: 0
- Forum: Live RSS Feeds
Introducing the Speech Synthesis API in Microsoft Edge

Starting with the Windows 10 Anniversary Update, Microsoft Edge will support the Speech Synthesis APIs defined in the W3C Web Speech API Specification. These APIs allow websites to convert text to audible speech with customizable voice and language settings. With them, website developers can add...
- News
- Thread
- Jun 1, 2016
- api demo feedback html5 javascript language settings microsoft edge playback control speech features speech recognition speech synthesis speech synthesis markup language ssml text-to-speech utterance voice control voice language voice pitch web speech api windows 10
- Replies: 0
- Forum: Live RSS Feeds
Using speech in your UWP apps: From talking to conversing

In the previous article, we introduced the idea of recognizing speech inside of a Windows 10 Universal Windows Platform (UWP) app and took a look at the SpeechRecognizer class and some of what it can do to enable speech recognition in our apps. In this article, we’re going to dig further into...
- News
- Thread
- May 24, 2016
- app development coding cortana grammar mediaelement playback rate speech apis speech grammar speech recognition speech recognizer speech synthesis speech synthesizer ssml task completion uwp voice command voice gender voice information windows 10 xaml
- Replies: 0
- Forum: Live RSS Feeds
Using speech in your UWP apps: It’s good to talk

As developers, we adapt as technologies move from the realm of Science Fiction into readily available SDKs. That’s certainly, or perhaps especially, true for speech technologies. In the past 5 years, devices have become more personal and demanding of new forms of interaction. In Windows 10...
- News
- Thread
- May 17, 2016
- code snippet continuous recognition cortana development interactive apps microphone natural interaction sdk speech apis speech recognition speech synthesis speech technologies text-to-speech user context user experience uwp windows 10
- Replies: 0
- Forum: Live RSS Feeds
App Development with Cortana | Visual Studio Toolbox

In this episode, Robert is joined by Link Removed, who shows us how to integrate Cortana into apps. Among the topics Nick covers and shows are voice commands, speech recognition and synthesis, background voice commands and continuous dictation. Resources: Link Removed Nick's Demos Link...
- News
- Thread
- Sep 9, 2015
- app development background commands coding community continuous dictation cortana demos development resources integration microsoft programming samples speech recognition speech synthesis tech talk tutorial universal apps visual studio voice commands windows 10
- Replies: 0
- Forum: Live RSS Feeds
Assistive Context-Aware Toolkit (The open source app used by Professor Stephen Hawking)

As soon as I read Mansib Rahman's post yesterday (as I write this) I knew I found the perfect project to highlight. I mean, come on it's Professor Stephen Hawking, Intel, .NET, WinForms (got to show some WinForm love now and then), open source and it's just cool! Link Removed I’m typing this...
- News
- Thread
- Aug 19, 2015
- accessibility apache license assistive devices assistive technology c sharp communication context-aware disabilities intel labs microsoft windows motor neuron disease open source predictive text professor hawking software development speech synthesis toolkit user interface visual studio word prediction
- Replies: 0
- Forum: Live RSS Feeds
"Cortana, show me how I can add you to my app's..."

Taking a breather from Visual Studio Extensions, today our Mobile Monday project shows you how you can build Cortana enabled projects... Link Removed ... You’ve likely already read about how Cortana will use the power of Bing to deliver personalized, natural experiences to users. What you may...
- News
- Thread
- Apr 21, 2014
- application development cortana developer guide features integration mobile development natural language programmatic logic quickstart sdk speech recognition speech synthesis support text-to-speech user experience utility visual studio voice commands windows phone
- Replies: 0
- Forum: Live RSS Feeds
VIDEO Vocoder

:razz: :cool: :up:
- whoosh
- Thread
- Feb 21, 2014
- audio effects music production speech synthesis vocoder
- Replies: 1
- Forum: The Water Cooler

Forums
Tags