ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

Minimum error rate training of inter-word context dependent acoustic model units in speech recognition

W. Chou, C.-E. Lee, Biing-Hwang Juang

In this paper, we study the issues related to string level acoustic modeling in continuous speech recognition. A new approach based on the minimum string error rate criterion is proposed to the training of inter-word context dependent acoustic model units. Under the proposed approach, the inter-word context dependent acoustic model units are modeled at the global string level by directly applying the minimum string error rate based discriminative analysis to string level acoustic model matching. Experimental results indicate that a significant error rate reduction can be achieved through the proposed approach. Based on the proposed approach, the best performance obtained by a gender-independent model on the TI connected digit corpus is 0.24% word error rate and 0.72% string error rate.


Cite as: Chou, W., Lee, C.-E., Juang, B.-H. (1994) Minimum error rate training of inter-word context dependent acoustic model units in speech recognition. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 439-442

@inproceedings{chou94_icslp,
  author={W. Chou and C.-E. Lee and Biing-Hwang Juang},
  title={{Minimum error rate training of inter-word context dependent acoustic model units in speech recognition}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={439--442}
}