ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013

Antti Suni, Reima Karhila, Tuomo Raitio, Mikko Kurimo, Martti Vainio, Paavo Alku

This paper describes modification of a TTS system for improving the intelligibility of speech in various noise conditions. First, the GlottHMM vocoder is used for training a voice with modal speech data. The vocoder and voice parameters are then modified to mimic the properties of Lombard effect based on a small amount of Lombard speech from the same speaker. More specifically, the durations are increased, fundamental frequency is raised, spectral tilt is decreased, the harmonic-to-noise ratio is increased, and a pressed glottal flow pulses are used in creating excitation. The formants of the speech are also enhanced and finally the speech is compressed in order to increase noise robustness of the voice. The evaluation results of the Hurricane Challenge 2013 indicate that the modified voice is mostly less intelligible than the unmodified natural speech, as expected, but more intelligible than the reference TTS voice, especially in the low SNR conditions.


doi: 10.21437/Interspeech.2013-766

Cite as: Suni, A., Karhila, R., Raitio, T., Kurimo, M., Vainio, M., Alku, P. (2013) Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013. Proc. Interspeech 2013, 3562-3566, doi: 10.21437/Interspeech.2013-766

@inproceedings{suni13_interspeech,
  author={Antti Suni and Reima Karhila and Tuomo Raitio and Mikko Kurimo and Martti Vainio and Paavo Alku},
  title={{Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={3562--3566},
  doi={10.21437/Interspeech.2013-766}
}