Flite in comparison to Festival has:
- pleasant voices
- samely comprehensible voices
- big problems when combination with ffmpeg streaming encoding over bash pipes (This is why I don’t use it for audiobook generation.)
- designed for embedded devices (see docs )
pico2wave in comparison to Festival has:
- similarly pleasant voice
- less comprehensible voice
- problems when one needs to be piping text into it and resolvable problems when one needs to be piping from it:
ln -sf /dev/fd/1 /tmp/stdout.wav; while IFS= read -r line; do pico2wave -w /tmp/stdout.wav "$line"; done | ...
- designed for embedded devices
Audio Book Generation
I love audiobooks, because they can utilize brain-time, which would otherwise be lost e.g. time when walking, running or hiking. Problem of audiobooks however is their scarcity, which I always struggled with especially in case of technical books.
Guess what? I’ve finally put together a simplistic tool for generating audiobooks from plain text that works! I call it text2gsm as it converts plain text file into GSM WAV compression format optimal for voice.
So next time I want to read book, but have no time to do it, I’ll generate myself a audiobook and listen to it during otherwise lost time. Neat!
Text 2 Speech on Android (Update)
While I used to use above software now, I mostly play text I need either via: