Hello Fediverse,
We are looking for Text-To-Speak (TTS) expertise to help or advise us on improving the default voice of the Linux desktop. 📣
Please reach out or boost
Thanks!
#Linux #tts #accessibility #a11y #GNOME #KDE #FreeSoftware #freedesktop #ml
This entry was edited (1 year ago)
reshared this
Lukáš Tyrychtr
in reply to Sonny • • •Peter Vágner likes this.
Lukáš Tyrychtr
Unknown parent • • •Sonny
in reply to Lukáš Tyrychtr • • •@tyrylu @fireborn yeah the more feedback I get the more I being to wonder if what we need isn't an easy way to discover and install speech synthesizers.
I would still like to have a better default though.
Sonny
Unknown parent • • •Lukáš Tyrychtr
Unknown parent • • •Musharraf
in reply to Sonny • • •Sonata provides TTS through C-library, command line app, GRPC server, and Python bindings.
I'm optimizing Sonata for use in low-resource, high responsiveness scenarios, such as screen reader usage.
An Android app that uses Sonata is currently being developed and will be released soon.
I'm very interested to know what I can offer.
Repo: github.com/mush42/sonata
GitHub - mush42/sonata: A cross-platform engine for neural TTS models.
GitHubPeter Vágner likes this.
Peter Vágner reshared this.
Peter Vágner
in reply to Musharraf • •Musharraf
in reply to Peter Vágner • • •Yes. Existing onnx models work fine.
You can also export the existing checkpoints using a different script for streaming speech in realtime.
Peter Vágner likes this.
Peter Vágner
in reply to Musharraf • •Musharraf
in reply to Peter Vágner • • •PR pending:
github.com/rhasspy/piper/pull/…
ONNX streaming support by mush42 · Pull Request #255 · rhasspy/piper
GitHubMusharraf
in reply to Sonny • • •rhasspy.github.io/piper-sample…
Piper Voice Samples
rhasspy.github.ioEitan
in reply to Sonny • • •I'm working on a D-Bus based spec and client library that would supersede the current platform APIs. Need to blog/publicize/socialize, but would love to talk about it at some point.
eeejay.github.io/libspiel/
Spiel-0.1
eeejay.github.ioPeter Vágner likes this.
Peter Vágner reshared this.
Eitan
in reply to Eitan • • •Eitan
in reply to Eitan • • •Patrick W
in reply to Sonny • • •Sonny
in reply to Patrick W • • •Notes on synthetic speech - Tink - Léonie Watson
Tink - Léonie Watson - On technology, food & life in the digital ageSonny
in reply to Lukáš Tyrychtr • • •Thanks for the feedback. We will look into making discovery/install/update/comparison of synthesizers more accessible.
Can you help me understand why you think espeak should remain as default?
From my side, I would like to encourage as much as possible developers to test their GUI with the screen reader.
I believe the default espeak voice is off-putting.
Stephan
in reply to Sonny • • •Andriy Utkin
in reply to Sonny • • •Peter Vágner likes this.
alcinnz
in reply to Sonny • • •I'm curious to see what comes out of this!
I don't have any idea how to improve the core code, but I have played with eSpeak & rendered HTML/CSS to SSML.
I notice that there's a sharp distinction between the voices which sound natural vs the ones which give me more knobs to take advantage of the medium. I'd like to see that remedied!
davidak
in reply to Sonny • • •here is my research on existing projects: pad.nixnet.services/s/0qeHhUC1… (a bit out of date)
i think espeak-ng is outdated technology. the quality is not acceptable
the best fully open source tts i know is Coqui TTS, but the company is shutting down. maybe you could still contract the people who also worked at mozilla before on the same project
github.com/coqui-ai/TTS
info@coqui.ai
GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
GitHubSonny
in reply to Sonny • • •We can contract BTW :)
Some examples of what we're interested in
• The state of speech synthesis on the Linux desktop and the various solutions
• What would it take to improve espeak voice
• How well machine learning solutions could work locally, specially in relationship to battery life and older hardware
Tuxicoman
in reply to Sonny • • •For STT, so the opposite, I found #vosk to be OK.
Kathy Reid
in reply to Sonny • • •