Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...
Title speaks for itself. Check this out:
http://windows.microsoft.com/en-CA/windows/downloads/diffusion-theme
Has 13 different backgrounds... They're amazing.