New beta release of Sonata-for-NVDA, formaly known as Piper-for-NVDA
NVDA 2024.1 compatibility
Support for fast variants for Piper voices. These fast variants improves responsiveness significantly because they use streaming synthesis
Improvements to responsiveness and speed across the board
Release page:
github.com/mush42/sonata-nvda/…
Direct download link:
github.com/mush42/sonata-nvda/…
NVDA 2024.1 compatibility
Support for fast variants for Piper voices. These fast variants improves responsiveness significantly because they use streaming synthesis
Improvements to responsiveness and speed across the board
Release page:
github.com/mush42/sonata-nvda/…
Direct download link:
github.com/mush42/sonata-nvda/…
Release v3.0-beta.1 · mush42/sonata-nvda
What's new NVDA 2024.1 compatibility Changed the name to Sonata since we plan to support additional TTS models besides Piper in the future. Support for fast variants for Piper voices. These fast v...GitHub
Peter Vágner likes this.
reshared this
Musharraf
in reply to Musharraf • • •Tamas G
in reply to Musharraf • • •Musharraf
in reply to Tamas G • • •If your existing voices are trained with Piper, then they'll work with this version.
If they fail to work for any reason, you can copy the config from any working voice to your voice, and edit relevant values.
Tamas G
in reply to Musharraf • • •Devin Prater :blind:
in reply to Tamas G • • •Musharraf
in reply to Tamas G • • •@Tamasg
If you have the original checkpoint, you can convert it to the new format.
Take a look at this script, which I used to export Piper's checkpoints:
github.com/mush42/piper-rt-mak…
I need to update the docs, and add a section on training voices.
piper-rt-maker/tasks.py at main · mush42/piper-rt-maker
GitHubTamas G reshared this.
Tamas G
in reply to Musharraf • • •Musharraf
in reply to Tamas G • • •Here're the steps to convert the checkpoint to fast format:
# Clone piper fork containing export code
git clone github.com/mush42/piper
cd ./piper
# Checkout streaming branch
git checkout streaming
cd ./piper/src/python
pip3 install -r requirements.txt
# Upgrade torch
pip3 install --upgrade torch pytorch-lightning onnx
source ./build_monotonic_align.sh
# Export. Edit paths
python3 -m piper_train.export_onnx_streaming --debug [checkpoint path] [export directory]
GitHub - mush42/piper: A fast, local neural text to speech system
GitHubTamas G
in reply to Musharraf • • •Tamas G
in reply to Musharraf • • •Musharraf
in reply to Tamas G • • •I didn't edit that script, it came from piper repo.
Anyways it does not effect the installation. I encountered it myself when exporting voices.
Tamas G
in reply to Musharraf • • •Tamas G
in reply to Musharraf • • •Tom Grant
in reply to Tamas G • • •Musharraf
in reply to Musharraf • • •After installing this version, you will lose all of your installed voices. Please use the voice manager to re-install the voices again.
Aryan
in reply to Musharraf • • •Nick Giannak III
in reply to Musharraf • • •Musharraf
in reply to Nick Giannak III • • •A dataset designed specifically for screen reader usage, goes a long way toward creating a good quality voice.
If guidelines are the issue, we can come up with a set of guidelines based on Microsoft/Google guidelines which are openly available.
Nick Giannak III
in reply to Musharraf • • •Timothy Wynn
in reply to Musharraf • • •AttributeError: 'WinmmWavePlayer' object has no attribute 'setVolume'
Pratik Patel
in reply to Timothy Wynn • • •Musharraf
in reply to Pratik Patel • • •Which voices?
Custom voices or the ones downloaded from the voice manager?
I'd appreciate it if you can provide NVDA logs.
Pratik Patel
in reply to Musharraf • • •Pratik Patel
in reply to Musharraf • • •After updating to the latest beta, the issue i reported still exists. I removed all voices, uninstalled the add-on, reinstalled it, and added voices again. Here's a link to the log.
dropbox.com/scl/fi/914e332qia2…
Piper.log
DropboxMusharraf
in reply to Pratik Patel • • •It seams like the server is not running.
Are you running NVDA on a 32-bit/ARM-64 machine. Sonata only works on 64-bit versions of Windows.
Otherwise, check if the server generated any logs in the following file path:
[NVDA config directory]\sonata\logs\sonata-grpc.log
If not, try running the following binary from a cmd window and report the output:
[NVDA config directory]\addons\sonata_neural_voices\synthDrivers\sonata_neural_voices\bin\sonata-grpc.exe
Pratik Patel
in reply to Musharraf • • •Thanks for trying to troubleshoot this. I'm running this on a Windows 64 bit on an Intel machine. Not Arm. The log file is not generated. Trying to run sonata-grpc.exe from the bin directory results in the following message:
The term 'sonata-grpc.exe' is not recognized as the name of a cmdlet, function, script file, or operable program.
Pratik Patel
in reply to Musharraf • • •I ran it as "./sonata-grpc.exe" and it gave me
"Starting sonata-grpc serverr at 127.0.0.1:49314"
Musharraf
in reply to Pratik Patel • • •Maybe send me NVDA log to diagnose why the TTS server isn't running.
Musharraf reshared this.
Pratik Patel
in reply to Musharraf • • •Here is the most recent log.
dropbox.com/scl/fi/h3bfsprt1q5…
Piper2.log
DropboxPratik Patel
in reply to Musharraf • • •Peter Vágner
in reply to Musharraf • •Thanks for all the fantastic work you are putting into this.
Musharraf
in reply to Peter Vágner • • •Here's how to build the sonata-grpc binary:
git clone github.com/mush42/sonata
cd ./sonata/sonata-grpc
# With Rust installed
cargo build --release
GitHub - mush42/sonata: A cross-platform engine for neural TTS models.
GitHubPeter Vágner likes this.
Musharraf
in reply to Peter Vágner • • •If you just want to set the eSpeak-ng data directory, you don't need to re-build the binary.
Just set the following environment variable before launching sonata-grpc:
SONATA_ESPEAKNG_DATA_DIRECTORY=[your custom espeak-data directory parent]
Peter Vágner likes this.
Peter Vágner
in reply to Musharraf • •Tom Grant
Unknown parent • • •Tamas G
Unknown parent • • •Tamas G
Unknown parent • • •Tamas G
Unknown parent • • •Andre Louis
in reply to Tamas G • • •Just downloaded this myself. Have you updated Keynote in recent times to take advantage of this new AddOn? That's one I'm very keen on trying now that my machine can handle them again, after the addon rewrite. Thanks.
@fireborn @TomGrant91 @mush42
Tamas G
in reply to Andre Louis • • •Andre Louis
in reply to Tamas G • • •Tamas G
in reply to Andre Louis • • •JamminJerry
in reply to Tamas G • • •Andre Louis
in reply to Tamas G • • •Musharraf
in reply to JamminJerry • • •If you provide the logs, I'll be able to diagnose the issue.
JamminJerry
in reply to Musharraf • • •JamminJerry
in reply to Musharraf • • •JamminJerry
in reply to Musharraf • • •Musharraf
in reply to JamminJerry • • •Just send the NVDA log.
JamminJerry
in reply to Musharraf • • •JamminJerry
in reply to Musharraf • • •Musharraf
in reply to JamminJerry • • •An easier way is to press insert+F1, then select all and copy.
You can paste it in a plain text file, save and send it.
JamminJerry
in reply to Musharraf • • •dropbox.com/scl/fi/66sx9tsqvxl…
Andre Louis
in reply to Musharraf • • •