For the past couple of months I've been working on Pied (pied.mikeasoft.com), an application that makes it easy to use modern, natural sounding, text-to-speech voices on Linux. It does this by integrating the Piper neural text-to-speech engine with speech-dispatcher, so most existing software will work with it out of the box.
The first beta version is now available in the snap store: snapcraft.io/pied
And available as a Flatpak from pied.mikeasoft.com
(Other package formats will follow)
I'd appreciate any feedback if you're able to test it, thanks!
#TTS #accessibility #ScreenReader #linux #TextToSpeech
This entry was edited (1 year ago)
Musharraf
in reply to Mike Sheldon • • •Nice work with Pied.
FWIW: I'm developing Sonata, a frontend for Piper (and other TTS models).
Recently, I implemented streaming support in Piper models:
PR: github.com/rhasspy/piper/pull/…
This feature is currently supported in Sonata, and it improves latency considerably.
In my Win 11 PC, it takes less than 170 seconds to generate the first chunk of audio.
Sonata repo: github.com/mush42/sonata
GitHub - mush42/sonata: A cross-platform engine for neural TTS models.
GitHub