EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Pronunciation Modeling and Lexical Adaptation in Midsize Vocabulary ASR

Louis F. M. ten Bosch, Nick Cremelie

Lernout & Hauspie Speech Products N.V., Belgium

A computational-phonological method is presented to automatically adapt the phone transcriptions in a lexicon to improve ASR performance in a number of mid-size recognition tasks. The lexical adaptation approach is based on supervised phoneme loops using cd-HMM segments to find alternatives for the transcriptions, and can be considered as a counterpart of the K-means algorithm but on symbolic level. The word error rate in a limited task (digit string recognition) with dialect speakers is shown to drop by 20-25 percent relative, starting from non-dialect digit transcriptions. Since the method is computationally involving, it is only feasible for relatively small tasks.

Full Paper

Bibliographic reference.  Bosch, Louis F. M. ten / Cremelie, Nick (2001): "Pronunciation modeling and lexical adaptation in midsize vocabulary ASR", In EUROSPEECH-2001, 1421-1424.