You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

r_claypool comments on The Sequences in MP3 Format - Less Wrong Discussion

12 Post author: r_claypool 08 July 2011 07:40PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (18)

You are viewing a single comment's thread. Show more comments above.

Comment author: r_claypool 01 August 2011 06:00:02AM *  0 points [-]

I have price quotes for Acapela, Cepstral, Wizzard (AT&T Voices), Neospeech, and Nuance RealSpeak. The range is from $1,000 to $15,000 USD.

Open source options are eSpeak (robotic), Festival (robotic), FreeTTS (robotic), Pico and others.

Pico is part of Android and it sounds more natural than other open source options I tried. Pico is licensed under Apache 2.0. Here's a demo.

The commercial voices are definately better; Loquendo is a good example.

So now I can start converting via Pico or try to get funding for a more natural voice. Thoughts?

Comment author: wedrifid 01 August 2011 09:20:14AM 0 points [-]

So now I can start converting via Pico or try to get funding for a more natural voice. Thoughts?

Start with pico I guess. Then we can possibly upgrade in the future.