EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Estimating Pronunciation Variations from Acoustic Likelihood Score for HMM Reconstruction

Liu Yi, Pascale Fung

Hong Kong University of Science and Technology, Hong Kong

It is widely acknowledged that pronunciation modeling is an efficient way to improve recognition performance in spontaneous speech. In pronunciation modeling, almost all methods of generating variation probability are based on relative frequency counting from DP alignment. In this paper, we investigate the local model mismatching caused by pronunciation variations and propose to estimate variation probability from acoustic likelihood score. According to estimated probability, we present a method of reconstructing pre-trained HMM models to include alternate pronunciations by sharing optimal mixture components instead of distributions. Experimental results show that using reconstructed HMM set reduces syllable error rate by 2.03% absolutely compared to the baseline system, also the accuracy improvement gained from proposed method is almost double with respect to that from previous DP alignment.

Full Paper

Bibliographic reference.  Yi, Liu / Fung, Pascale (2001): "Estimating pronunciation variations from acoustic likelihood score for HMM reconstruction", In EUROSPEECH-2001, 1425-1428.