ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition

April 13-16, 2003
Tokyo Institute of Technology, Tokyo, Japan

Using Continuous Space Language Models for Conversational Speech Recognition

Holger Schwenk, Jean-Luc Gauvain

LIMSI-CNRS, Orsay, France

Language modeling for conversational speech suffers from the limited amount of available adequate training data. This paper describes a new approach that performs the estimation of the language model probabilities in a continuous space, allowing by these means smooth interpolation of unobserved n-grams. This continuous space language model is used during the last decoding pass of a state-of-the-art conversational telephone speech recognizer to rescore word lattices. For this type of speech data, it achieves consistent word error reductions of more than 0.4% compared to a carefully tuned backoff n-gram language model.


Full Paper

Bibliographic reference.  Schwenk, Holger / Gauvain, Jean-Luc (2003): "Using continuous space language models for conversational speech recognition", in SSPR-2003, paper TMO5.