ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Using information retrieval methods for language model adaptation

Langzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda

In this paper we report experiments on language model adaptation using information retrieval methods, drawing upon recent developments in information extraction and topic tracking. One of the problems is extracting reliable topic information with high confidence from the audio signal in the presence of recognition errors. The work in the information retrieval domain on information extraction and topic tracking suggested a new way to solve this problem. In this work, we make use of information retrieval methods to extract topic information in the word recognizer hypotheses, which are then used to automatically select adaptation data from a very large general text corpus. Two adaptive language models, a mixture based model and a MAP based model, have been investigated using the adaptation data. Experiments carried out with the LIMSI Mandarin broadcast news transcription system gives a relative character error rate reduction of 4.3% with this adaptation method.


doi: 10.21437/Eurospeech.2001-86

Cite as: Chen, L., Gauvain, J.-L., Lamel, L., Adda, G., Adda, M. (2001) Using information retrieval methods for language model adaptation. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 255-258, doi: 10.21437/Eurospeech.2001-86

@inproceedings{chen01b_eurospeech,
  author={Langzhou Chen and Jean-Luc Gauvain and Lori Lamel and Gilles Adda and Martine Adda},
  title={{Using information retrieval methods for language model adaptation}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={255--258},
  doi={10.21437/Eurospeech.2001-86}
}