speech synthesis

Speech synthesis demo

Speech sounds can be minimally specified in terms of a small set of parameters (variables), each of which can be described in terms of how they sound (their auditory characteristics), how they are made (physiological characteristics), or their physical (acoustic) characteristics.

This speech was synthesized in 1971 by Peter Ladefoged on a synthesizer at UCLA. The values of the parameters were a modified version of a set provided by John Holmes.

	AUDITORY	PHYSIOLOGICAL	ACOUSTIC
1	Pitch of voice	Rate of vibration of the vocal folds	Fundamental frequency
2	Overtone pitch (1)	First resonance of the vocal tract	Formant 1 frequency
3	Loudness of overtone (1)		Formant 1 amplitude
4	Overtone pitch (2)	Second resonance of the vocal tract	Formant 2 frequency
5	Loudness of overtone (2)		Formant 2 amplitude
6	Overtone pitch (3)	Third resonance of the vocal tract	Formant 3 frequency
7	Loudness of overtone (3)		Formant 3 amplitude
8	Pitch of noise	Fricative and stop bursts	Center of noise frequency
9	Loudness of noise		Amplitude of noise

The 8 links below demonstrate how speech can be built up using these parameters and additional (fixed) higher formants.

Formant 1 alone (parameters 2 and 3 above)

Formant 2 alone (parameters 4 and 5 above)

Formant 3 alone (parameters 6 and 7 above)

Formants 1, 2 and 3 (parameters 2 - 7 above)

Formants 1, 2 and 3 plus additional overtones (parameters 2 - 7 above plus fixed overtones)

Fricative and burst noises alone (parameters 8 and 9)

Everything except the fundamental frequency (parameters 2-9)

Everything including the fundamental frequency (parameters 1-9 )