ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Search and classification based language model adaptation

Qin Shi, Stephen M. Chu, Wen Liu, Hong-Kwang Jeff Kuo, Yi Liu, Yong Qin

Adaptation techniques in language modeling have shown growing potentials in improving speech recognition performance. For topic adaptation, a set of pre-defined topic-specific language models are typically used, and adaptation is achieved through adjusting the interpolation weights. However, mismatch between the test data and the pre-defined models inevitably exists and is left untreated in the static approach. Instead of tuning the parameters in the existing models, this paper describes a method that dynamically extracts relevant documents from training sources according to intermediate decoding hypotheses to build new targeted language models. Different from general search-based document collection, a new and effective ranking method is used here for candidate extraction. The targeted language models are interpolated with the static topic language models and a general language model, and used for lattice rescoring. The proposed adaptation technique is implemented in a state-of-the-art Mandarin broadcast transcription system, and evaluated on the GALE task. We show that static topic adaptation reduces the relative character error rate by 4.9%. It is further shown that the proposed dynamic adaptation technique attains an additional 10.3% reduction in error rate.

doi: 10.21437/Interspeech.2008-261

Cite as: Shi, Q., Chu, S.M., Liu, W., Kuo, H.-K.J., Liu, Y., Qin, Y. (2008) Search and classification based language model adaptation. Proc. Interspeech 2008, 1578-1581, doi: 10.21437/Interspeech.2008-261

  author={Qin Shi and Stephen M. Chu and Wen Liu and Hong-Kwang Jeff Kuo and Yi Liu and Yong Qin},
  title={{Search and classification based language model adaptation}},
  booktitle={Proc. Interspeech 2008},