The Seventh ISCA Tutorial and Research Workshop on Speech Synthesis

Kyoto, Japan
September 22-24, 2010

Improving Speech Synthesis for Noisy Environments

Gopala Krishna Anumanchipalli, Prasanna Kumar Muthukumar, Udhyakumar Nallasamy, Alok Parlikar, Alan W. Black, Brian Langner

Language Technologies Institute, Carnegie Mellon University, Pittsburgh, USA

Speech Synthesizers have traditionally been built on carefully read speech that is recorded in studio environment. Such voices are suboptimal for use in noisy conditions, which is inevitable in a majority of deployed speech systems. In this work, we attempt to modify the output of the speech synthesizers to make it more appropriate for noisy environments. Comparison of spectral and prosodic features of speech in noise and results of some conversion techniques are presented.

Index Terms: speech synthesis, speech in noise, companding

Full Paper

Bibliographic reference.  Anumanchipalli, Gopala Krishna / Muthukumar, Prasanna Kumar / Nallasamy, Udhyakumar / Parlikar, Alok / Black, Alan W. / Langner, Brian (2010): "Improving speech synthesis for noisy environments", In SSW7-2010, 154-159.