You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
ttsresearch
About this tag
The ttsresearch tag on WindowsForum.com covers discussions about text-to-speech (TTS) models intended for research purposes. A featured thread highlights Microsoft's VibeVoice-1.5B, an open-source TTS model capable of synthesizing up to 90 minutes of coherent, multi-speaker audio with up to four distinct speakers. The model is released with explicit safety controls and is positioned for research use. Topics under this tag include long-form TTS, multi-speaker synthesis, and open-source AI models for speech generation, with a focus on experimental and academic applications rather than consumer deployment.
Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...