The big question ahead: how hard would it be to decouple from eSpeak and build our own text→phoneme layer?
The answer is: non-trivial, but very doable — especially for languages like Hungarian, where spelling is regular and stress rules are simple.
The hard part isn’t DSP. The engine already works.
The real work is linguistic: normalization, phoneme rules, edge cases, and a lot of listening and iteration.
It’s the kind of challenge that rewards patience more than clever tricks — and honestly, that’s part of the appeal.

Gerald Squelart
in reply to Jamie Teh • • •Recursive lambdas from C++14 to C++23 - Notes from /dev/null
www.dev0notes.comSina Bahram
in reply to Jamie Teh • • •