4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Learning Pronunciation Dictionary from Speech Data

Christian-Michael Westendorf, Jens Jelitto

Dresden University of Technology, Institute for Technical Acoustics, Dresden, Germany

In this paper an algorithm and first results from our investigations in automatically learning pronunciation variations from speech data are presented. Pronunciation dictionaries establish an important feature in state-of-the-art speech recognition systems. In most systems only simple dictionaries containing the canonical pronunciation forms are implemented. However, for a good recognition performance more sophisticated dictionaries including pronunciation variations are essential. The generation of such dictionaries by hand is an extremely time consuming task, and the introduction of errors and inconsistencies is probable. We show an approach for automatically generating suitable pronunciation dictionaries from the speech data base itself, as they are desirable not only for speech recognition tasks but also for speech technology and phonologic research in general. The only knowledge sources besides the data base are the (unlabeled) signals and their transliterations on word level. First experiments yielding promising results have been performed with the software system DataLab [6], which integrates the recognition system of the TU Dresden.

Full Paper

Bibliographic reference.  Westendorf, Christian-Michael / Jelitto, Jens (1996): "Learning pronunciation dictionary from speech data", In ICSLP-1996, 1045-1048.