You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
speechtech
About this tag
The speechtech tag on WindowsForum.com covers discussions about speech technology, including text-to-speech (TTS) systems. A notable thread highlights Microsoft's VibeVoice-1.5B, an open-source TTS model for research that synthesizes up to 90 minutes of multi-speaker audio with up to four distinct speakers. The model is released with safety controls for research use. This tag is relevant for users interested in speech synthesis, open-source AI models, and Microsoft's contributions to speech technology.
Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...