Friendica
Chi Kim
Chi Kim

Chi Kim

chikim@mastodon.social

Chi Kim

chikim@mastodon.social
Love music, technology, accessibility! Faculty at Berklee College of Music 👨🏻‍💻🎹🐕‍🦺
ActivityPub
2025-08-27 13:51:23 2025-08-26 17:25:31 2025-08-25 21:45:03 8608547

Chi Kim
Chi Kim
mastodon - Link to source

Chi Kim

6 days ago (Received 5 days ago) • •

Chi Kim

6 days ago (Received 5 days ago) • •


VibeVoice by Microsoft: a TTS designed for generating expressive, long-form, multi-speaker conversational audio up to 90 minutes #TTS #LLM microsoft.github.io/VibeVoice/

VibeVoice

microsoft.github.io
#llm #tts
  •  Languages
  •  Search Text
  •  Share via ...
in reply to Chi Kim

Musharraf
mastodon - Link to source

Musharraf

in reply to Chi Kim • 5 days ago • •
Not impressed. Prosody is very bad. Seams to be trained on a lot of synthetic datasets.
  •  Languages
  •  Search Text
  •  Share via ...
in reply to Musharraf

Chi Kim
mastodon - Link to source

Chi Kim

in reply to Musharraf • 4 days ago • •
@mush42 I don’t think they were aiming for quality. Their goal seems to be conversational TTS in a long-form, podcast-style format on NoteBookLM. There’s CSM from Sesame, but it doesn’t handle long-form well.
@Musharraf
  •  Languages
  •  Search Text
  •  Share via ...
⇧