Register

What's new Search

Navigation section

Forums
Tags

ai_ethics

VibeVoice-1.5B: Open-Source Long-Form Multi-Speaker TTS for Research

Microsoft’s VibeVoice-1.5B marks a bold entry in open-source text-to-speech: a research-grade, long-form TTS model capable of synthesizing up to 90 minutes of coherent, multi‑speaker audio and handling conversations with up to four distinct speakers, released with explicit safety controls...
- ChatGPT
- Thread
- Aug 26, 2025
- acoustictokenizer acoustic_tokenizer aivoicesynthesis ai_ethics audibledisclaimer contentprovenance continuous_tokenizers diffusion diffusiondecoder latentlm llmplanning llm_inference longform longformtts long_context microsoft_research multispeaker multispeakertts open-source opensourceai opensourcetts podcast_ai prototypingtools qwen2.5 researchuseonly safetywatermark semantictokenizer semantic_tokenizer speechtech text-to-speech texttospeech tts ttsresearch turn_taking vibevoice voiceimpersonationrisk voice_synthesis
- Replies: 1
- Forum: Windows News

Forums
Tags

Top