GitHub - gojiplus/text_to_audio_book: Uses Google Text to Speech to Generate AudioBooks from Text

Text to Audio (Books)

Audio is great. You can listen to things when commuting, doing your household chores, on the beach---wherever you please.

Some of the things you want to listen to are things where the text is not under copyright, for instance, everything published in the U.S. before 1923 is not copyrighted, or where you have bought the electronic text. And now that the speech synthesis is good enough, it doesn't make sense to buy an audio book of something that you can very cheaply create using the Google Speech Synthesis API.

This simple script lowers the bar for producing audio from text yet further by wrapping the process. It takes a text file (or a SSML file where available) as input along with a parameter for which voice you want to use and produces a mp3.

Functionality

Google speech synthesis API breaks the document into 5k character chunks, automatically truncating at word boundaries and adding a 200ms lag between two 5k character chunks. We chunk by sentence, getting as many full sentences as possible within the 5k limit. For plays, e.g., here (Pride and Prejudice), we also take a separate character, gender file that lists the gender of each character, e.g., here for Pride and Prejudice, so that we can switch between male and female voices.

Prerequisites

pydub need either ffmpeg or libav to support MP3. Please look at getting ffmpeg set up

Running the Script

pip install -r requirements.txt
python text_to_audio.py -o sample_audio.mp3 sample_text.txt

Illustration

Project Gutenberg provides text of books in various formats for which the copyright has lapsed. We illustrate the use of the script, we use it to get the mp3 for Jane Austen's Pride and Prejudice. (We don't use plain text as there are a lot of places where there is 'underscore' etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
character_gender_pap.csv		character_gender_pap.csv
requirements.txt		requirements.txt
text_to_audio.py		text_to_audio.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

character_gender_pap.csv

character_gender_pap.csv

requirements.txt

requirements.txt

text_to_audio.py

text_to_audio.py

Repository files navigation

Text to Audio (Books)

Functionality

Prerequisites

Running the Script

Illustration

About

Releases

Packages

Contributors 2

Languages

gojiplus/text_to_audio_book

Folders and files

Latest commit

History

Repository files navigation

Text to Audio (Books)

Functionality

Prerequisites

Running the Script

Illustration

About

Resources

Stars

Watchers

Forks

Languages