Hello Fediverse,
We are looking for Text-To-Speak (TTS) expertise to help or advise us on improving the default voice of the Linux desktop. 📣
Please reach out or boost
Thanks!
#Linux #tts #accessibility #a11y #GNOME #KDE #FreeSoftware #freedesktop #ml
This entry was edited (11 months ago)
reshared this
Lukáš Tyrychtr
in reply to Sonny • • •Peter Vágner likes this.
Lukáš Tyrychtr
Unknown parent • • •Sonny
in reply to Lukáš Tyrychtr • • •@tyrylu @fireborn yeah the more feedback I get the more I being to wonder if what we need isn't an easy way to discover and install speech synthesizers.
I would still like to have a better default though.
Sonny
Unknown parent • • •Lukáš Tyrychtr
Unknown parent • • •Musharraf
in reply to Sonny • • •Sonata provides TTS through C-library, command line app, GRPC server, and Python bindings.
I'm optimizing Sonata for use in low-resource, high responsiveness scenarios, such as screen reader usage.
An Android app that uses Sonata is currently being developed and will be released soon.
I'm very interested to know what I can offer.
Repo: github.com/mush42/sonata
GitHub - mush42/sonata: A cross-platform engine for neural TTS models.
GitHubPeter Vágner likes this.
Peter Vágner reshared this.
Peter Vágner
in reply to Musharraf • •Musharraf
in reply to Peter Vágner • • •Yes. Existing onnx models work fine.
You can also export the existing checkpoints using a different script for streaming speech in realtime.
Peter Vágner likes this.
Peter Vágner
in reply to Musharraf • •Musharraf
in reply to Peter Vágner • • •PR pending:
github.com/rhasspy/piper/pull/…
ONNX streaming support by mush42 · Pull Request #255 · rhasspy/piper
GitHubMusharraf
in reply to Sonny • • •rhasspy.github.io/piper-sample…
Piper Voice Samples
rhasspy.github.ioEitan
in reply to Sonny • • •I'm working on a D-Bus based spec and client library that would supersede the current platform APIs. Need to blog/publicize/socialize, but would love to talk about it at some point.
eeejay.github.io/libspiel/
Spiel-0.1
eeejay.github.ioPeter Vágner likes this.
Peter Vágner reshared this.
Eitan
in reply to Eitan • • •Eitan
in reply to Eitan • • •Patrick W
in reply to Sonny • • •Sonny
in reply to Patrick W • • •Notes on synthetic speech - Tink - Léonie Watson
Tink - Léonie Watson - On technology, food & life in the digital ageSonny
in reply to Lukáš Tyrychtr • • •Thanks for the feedback. We will look into making discovery/install/update/comparison of synthesizers more accessible.
Can you help me understand why you think espeak should remain as default?
From my side, I would like to encourage as much as possible developers to test their GUI with the screen reader.
I believe the default espeak voice is off-putting.
Stephan
in reply to Sonny • • •Andriy Utkin
in reply to Sonny • • •Peter Vágner likes this.
alcinnz
in reply to Sonny • • •I'm curious to see what comes out of this!
I don't have any idea how to improve the core code, but I have played with eSpeak & rendered HTML/CSS to SSML.
I notice that there's a sharp distinction between the voices which sound natural vs the ones which give me more knobs to take advantage of the medium. I'd like to see that remedied!
Bri😻
in reply to Sonny • • •davidak
in reply to Sonny • • •here is my research on existing projects: pad.nixnet.services/s/0qeHhUC1… (a bit out of date)
i think espeak-ng is outdated technology. the quality is not acceptable
the best fully open source tts i know is Coqui TTS, but the company is shutting down. maybe you could still contract the people who also worked at mozilla before on the same project
github.com/coqui-ai/TTS
info@coqui.ai
GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
GitHubSonny
in reply to Sonny • • •We can contract BTW :)
Some examples of what we're interested in
• The state of speech synthesis on the Linux desktop and the various solutions
• What would it take to improve espeak voice
• How well machine learning solutions could work locally, specially in relationship to battery life and older hardware
Tuxicoman
in reply to Sonny • • •For STT, so the opposite, I found #vosk to be OK.
Kathy Reid
in reply to Sonny • • •