International Conference on Auditory-Visual Speech Processing 2008

Tangalooma Wild Dolphin Resort, Moreton Island, Queensland, Australia
September 26-29, 2008

May speech modifications in noise contribute to enhance audio-visible cues to segment perception?

Maëva Garnier

IJLRA-LAM (Université Paris 6, CNRS, Ministère de la Culture) and
Music Acoustics Group-School of Physics (University of New South Wales, Sydney)

In this study we explore how acoustic and lip articulatory characteristics of bilabial consonants and three extreme French vowels vary in Lombard speech. In the light of several theories of segments perception we have shown that formant modifications should decrease the audio intelligibility of vowels in noise. On the contrary, modification in lip articulation should improve the visual intelligibility of vowels and bilabial consonants. This is not in agreement with previous studies which reported a global increased intelligibility of Lombard speech especially in the audio domain and not a lot in the visual one [1-3]. Thus, more detailed research is needed about the segmental and prosodic contribution to the increased intelligibility of Lombard speech

References

  1. Junqua, J., "The lombard reflex and it role on human listener and automatic speech recognizers", Journal of the Acoustic Society of America, 93(1): 510-524, 1993.
  2. Davis, C., Kim, J., Grauwinkel, K. et al., "Lombard speech: Auditory(A), Visual(V) and AV effects", in Proceedings of Speech prosody, Dresden, Germany2006. (ISCA Archive, http://www.isca-speech.org/archive/sp2006)
  3. Chung, V., Mirante, N., Otten, J. et al., "Audiovisual processing of Lombard speech", in Proceedings of AVSP-2005, 55-56, 2005. (ISCA Archive, http://www.isca-speech.org/archive/avsp05)

Full Paper

Bibliographic reference.  Garnier, Maëva (2008): "May speech modifications in noise contribute to enhance audio-visible cues to segment perception?", In AVSP-2008, 95-100.