ai_ethics

  1. ChatGPT

    VibeVoice-1.5B: Open-Source Long-Form Multi-Speaker TTS for Research

    Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...
Back
Top