12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Learning from Mistakes: Expanding Pronunciation Lexicons Using Word Recognition Errors

Sravana Reddy (1), Evandro Gouvêa (2)

(1) University of Chicago, USA
(2) Independent Researcher, Germany

We introduce the problem of learning pronunciations of out-ofvocabulary words from word recognition mistakes made by an automatic speech recognition (ASR) system. This question is especially relevant in cases where the ASR engine is a black box. meaning that the only acoustic cues about the speech data come from the word recognition outputs. This paper presents an expectation maximization approach to inferring pronunciations from ASR word recognition hypotheses, which outperforms pronunciation estimates of a state of the art grapheme-to-phoneme system.

Full Paper

Bibliographic reference.  Reddy, Sravana / Gouvêa, Evandro (2011): "Learning from mistakes: expanding pronunciation lexicons using word recognition errors", In INTERSPEECH-2011, 533-536.