In this paper we investigate measures for the evaluation of pronunciation dictionaries that can be used independently of the type of lexicon, the language, a specific recognizer and how the dictionary was generated. We will describe statistical measures, measures based on information theory and performance measures and give examples how these measures can be practically applied in supervision of data-driven dictionary training, selection of pronunciation variants and evaluation of the consistency of different dictionaries. Although the introduced measures are independent of the type of dictionary, we only report results obtained with a datadriven dictionary generation and do not address measures specific to rule-based approaches.
Cite as: Wolff, M., Eichner, M., Hoffmann, R. (2002) Measuring the quality of pronunciation dictionaries. Proc. ITRW on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PMLA 2002), 117-122
@inproceedings{wolff02_pmla, author={Matthias Wolff and Matthias Eichner and RĂ¼diger Hoffmann}, title={{Measuring the quality of pronunciation dictionaries}}, year=2002, booktitle={Proc. ITRW on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PMLA 2002)}, pages={117--122} }