Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Improving Speech Synthesis for High Intelligibility under Adverse Conditions

Davis Pan, Brian Heng, Shiufun Cheung, Ed Chang

Cambridge Research Laboratory, Compaq Computer Corporation, One Cambridge Center, Cambridge, MA, USA

We investigate methods of improving the intelligibility of synthetic speech under noisy or low-fidelity acoustic conditions. Techniques explored improve speech in a natural manner, such that training wonít be required for the user to understand the enhanced speech. While the improvements are natural in this respect, the changes arenít limited to creating only speech that is achievable by a human vocal tract. Modifications fall into three broad classes: increasing phoneme amplitude, altering spectral shape, and lengthening phoneme duration. Listening tests conducted in noisy and noise-free conditions demonstrate significant improvements to intelligibility for most of the subject phonemes.


Full Paper

Bibliographic reference.  Pan, Davis / Heng, Brian / Cheung, Shiufun / Chang, Ed (2000): "Improving speech synthesis for high intelligibility under adverse conditions", In ICSLP-2000, vol.1, 721-724.