Skip to main content


Introducing Voicebox: The first generative AI model for speech to generalize across tasks with state-of-the-art performance

https://ai.facebook.com/blog/voicebox-generative-ai-model-speech/?utm_source=tldrai

in reply to victor tsaran

I'm not super impressed with the audio quality, but the noise filter thing was neat.
in reply to Drew Mochak

@objectinspace Similarly, not impressed with voices. I guess the idea behind their generation is what impresses... But then, it's just a demo!!!
in reply to victor tsaran

There was a paragraph (don't have time now to dig it up) where it was talking about how it beats the current state of the art TTS's by a certain amount, and then named them? But I have never heard of them so I had no frame of reference.
in reply to victor tsaran

What's the difference between this project and, say, Piper in terms of performance? #CC @ZBennoui
https://github.com/rhasspy/piper