Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Phonetic Ignorance is Bliss: Investigating the Effects of Phonetic Information Reduction on ASR Performance

Eric Fosler-Lussier, C. Anton Rytting, Soundararajan Srinivasan

Ohio State University, Columbus, OH, USA

Perception studies have long argued that phonetic confusions are more likely to happen across some phonetic features than other (e.g., place of articulation rather than manner) [1]. Similarly, we and others have noted that pronunciation variation occurs more frequently in unstressed syllables, and in syllable codas. This suggests that a phonetic information structure is at play, where for decoding purposes it is important to get phonetic information accurate in stressed syllables, but less so in unstressed syllables. In this work, we explore the role of phonetic information in clean and noisy speech by reducing the phonetic information available to the recognizer. A surprising result is that replacing some phones with manner classes in the dictionary improves recognition in one noise condition.

Full Paper

Bibliographic reference.  Fosler-Lussier, Eric / Rytting, C. Anton / Srinivasan, Soundararajan (2005): "Phonetic ignorance is bliss: investigating the effects of phonetic information reduction on ASR performance", In INTERSPEECH-2005, 1249-1252.