Sixth International Conference on Spoken Language Processing (ICSLP 2000)

Beijing, China
October 16-20, 2000

On Enhancing Katz-Smoothing Based Back-Off Language Model

Jian Wu, Fang Zheng

Center of Speech Technology, State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing, China

Though the statistical language modeling plays an important role in speech recognition, there are still problems that are difficult to be solved such as the sparseness of training data. Generally, two kinds of smoothing approaches, namely the back-off model and the interpolated model, have been proposed to solve the problem of the impreciseness of language models caused by the sparseness of training data. By expanding the idea of interpolation model to Katz-smoothing based re-estimation of the seen word pairs, a back-off model based modified method is proposed, referred to as the enhanced Katz smoothing with deleted interpolation (EKSWDI). A uniform expression and two simplified versions for this modified model are also given. Experiments on a Chinese pinyin-to-character conversion system and the perplexity measure show that the proposed model has a better performance than the Katz smoothing method does.

Full Paper

Bibliographic reference.  Wu, Jian / Zheng, Fang (2000): "On enhancing katz-smoothing based back-off language model", In ICSLP-2000, vol.1, 198-201.