In this paper, we present our latest investigations on pronunciation modeling and its impact on ASR. We propose completely automatic methods to detect, remove, and substitute inconsistent or flawed entries in pronunciation dictionaries. The experiments were conducted on different tasks, namely (1) word-pronunciation pairs from the Czech, English, French, German, Polish, and Spanish Wiktionary , a multilingual wiki-based open content dictionary, (2) our GlobalPhone Hausa pronunciation dictionary , and (3) pronunciations to complement our Mandarin-English SEAME code-switch dictionary . In the final results, we fairly observed on average an improvement of 2.0% relative in terms of word error rate and even 27.3% for the case of English Wiktionary word-pronunciation pairs.
Index Terms: pronunciation dictionaries, automatic error recovery, multilingual speech recognition
Bibliographic reference. Schlippe, Tim / Ochs, Sebastian / Vu, Ngoc Thang / Schultz, Tanja (2012): "Automatic error recovery for pronunciation dictionaries", In INTERSPEECH-2012, 2298-2301.