Chi Kim 1 month ago • • Chi Kim 1 month ago • • VibeVoice by Microsoft: a TTS designed for generating expressive, long-form, multi-speaker conversational audio up to 90 minutes #TTS #LLM microsoft.github.io/VibeVoice/VibeVoicemicrosoft.github.io #llm #tts Languages Search Text Share via ...
in reply to Chi Kim Musharraf in reply to Chi Kim • 1 month ago • • Not impressed. Prosody is very bad. Seams to be trained on a lot of synthetic datasets. Languages Search Text Share via ...
Musharraf
in reply to Chi Kim • • •