You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
multispeakertts
About this tag
The multispeakertts tag on WindowsForum covers discussions about multi-speaker text-to-speech technology, with a focus on Microsoft's VibeVoice-1.5B model. This open-source TTS system is designed for research use, capable of synthesizing up to 90 minutes of coherent audio with up to four distinct speakers. Topics include long-form conversational speech generation, safety controls, and the model's role as a frontier in open-source TTS. The tag is relevant for researchers and developers exploring multi-speaker TTS models and their applications.
Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...