You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
latentlm
About this tag
LatentLM is a tag on WindowsForum.com that currently covers Microsoft's VibeVoice-1.5B, an open-source text-to-speech model for research. This model generates long-form, multi-speaker audio with up to 90 minutes of coherent speech and supports four distinct speakers. The tag may expand to include other latent language model topics in AI and machine learning as they appear on the forum.
Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...