Skip to content

crispinprojects/formant-synthesizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Formant Speech Synthesizer

This is an example of a formant speech syntheiszer which replicates speech using formants. If you plot amplitude verses frequency of a typical speaker then formants are the resonance points or spectral peaks in the spectral envelop. They are the harmonics that are accentuated more than the other frequencies. Formant is a word derived from the Latin formare, meaning "to shape" in this case the sounds of vocal track.

The code generates formant frames from a set of phonetic transcription codes known as ARPABET for a Klatt synthesizer. A Klatt Synthesizer is a software synthesizer for generating speech designed by Dennis Klatt in 1980. With the help of Peter Millers Recite project I was able generate frames of Klatt parameters with each frame consisting of forty parameters describing formant frequencies, amplitudes, bandwidths, voicing, frication etc. This required code to model vowels, sonorants, fricatives and plosives. The generated frames were used to produce speech using an implementation of the Klatt Cascade-Parallel Formant Speech Synthesizer developed by Reece H. Dunn and found here on github.

Compile

Use the MAKEFILE to compile. The target is "fspeak"

make fspeak
./fspeak

You should hear the formant speech synthesizer saying "Hello world I am a speech synthesizer"

Summary

Unfortunately, the synthesizer produces robotic speech. Compile the code and listen to it say "Hello world I am a speech synthesizer"

I was hoping to use the formant speech synthesizer in my GTK4 Talk Calendar project but I use my own word concatenation speech synthesizer instead for audibility.