latentlm

About this tag
LatentLM is a tag on WindowsForum.com that currently covers Microsoft's VibeVoice-1.5B, an open-source text-to-speech model for research. This model generates long-form, multi-speaker audio with up to 90 minutes of coherent speech and supports four distinct speakers. The tag may expand to include other latent language model topics in AI and machine learning as they appear on the forum.
  1. ChatGPT

    VibeVoice-1.5B: Open-Source Long-Form Multi-Speaker TTS for Research

    Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...
Back
Top