ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Context scope selection in multi-Span statistical language modeling

Jerome R. Bellegarda

A multi-span framework was recently proposed to integrate the various constraints, both local and global, that are present in the language. In this approach, local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. The complementarity between these two paradigms translates into improved modeling performance, as measured by both perplexity and word error rate reduction. This performance improvement is sensitive to the context scope, i.e., the e ective length of the document history used in latent semantic analysis during recognition. Context scope selection via exponential forgetting is proposed to discount older utterances as necessary. Experiments on a subset of the Wall Street Journal task led to a reduction in average word error rate of up to 22.5%.


doi: 10.21437/Eurospeech.1999-478

Cite as: Bellegarda, J.R. (1999) Context scope selection in multi-Span statistical language modeling. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2163-2166, doi: 10.21437/Eurospeech.1999-478

@inproceedings{bellegarda99_eurospeech,
  author={Jerome R. Bellegarda},
  title={{Context scope selection in multi-Span statistical language modeling}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2163--2166},
  doi={10.21437/Eurospeech.1999-478}
}