Vaclav Kosar's face photo
Vaclav Kosar
Software, Machine Learning, & Business

Linux Text To Speech Comparison: Flite Vs Pico2Wave Vs Festival

Comparison of open-source text to speech (TTS) software in terms of pleasantness, comprehensibility, and modularity.

Flite in comparison to Festival has:

  • pleasant voices
  • samely comprehensible voices
  • big problems when combination with ffmpeg streaming encoding over bash pipes (This is why I don’t use it for audiobook generation.)
  • designed for embedded devices (see docs )

pico2wave in comparison to Festival has:

  • similarly pleasant voice
  • less comprehensible voice
  • problems when one needs to be piping text into it and resolvable problems when one needs to be piping from it: ln -sf /dev/fd/1 /tmp/stdout.wav; while IFS= read -r line; do pico2wave -w /tmp/stdout.wav "$line"; done | ...
  • designed for embedded devices

Audio Book Generation

I love audiobooks, because they can utilize brain-time, which would otherwise be lost e.g. time when walking, running or hiking. Problem of audiobooks however is their scarcity, which I always struggled with especially in case of technical books.

Guess what? I’ve finally put together a simplistic tool for generating audiobooks from plain text that works! I call it text2gsm as it converts plain text file into GSM WAV compression format optimal for voice.

So next time I want to read book, but have no time to do it, I’ll generate myself a audiobook and listen to it during otherwise lost time. Neat!

Text 2 Speech on Android (Update)

While I used to use above software now, I mostly play text I need either via:

Created on 26 Dec 2016.

Let's connect

Privacy Policy How many days left in this quarter? Twitter Bullet Points to Copy & Paste