We introduce the problem of learning pronunciations of out-ofvocabulary words from word recognition mistakes made by an automatic speech recognition (ASR) system. This question is especially relevant in cases where the ASR engine is a black box. meaning that the only acoustic cues about the speech data come from the word recognition outputs. This paper presents an expectation maximization approach to inferring pronunciations from ASR word recognition hypotheses, which outperforms pronunciation estimates of a state of the art grapheme-to-phoneme system.
Bibliographic reference. Reddy, Sravana / Gouvêa, Evandro (2011): "Learning from mistakes: expanding pronunciation lexicons using word recognition errors", In INTERSPEECH-2011, 533-536.