5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Modeling Linguistic Segment and Turn Boundaries for N-Best Rescoring of Spontaneous Speech

Andreas Stolcke

Speech Technology and Research Laboratory, SRI International, Menlo Park, CA, USA

Language modeling, especially for spontaneous speech, often suffers from a mismatch of utterance segmentations between training and test conditions. In particular, training often uses linguistically-based segments, whereas testing occurs on acoustically determined segments, resulting in degraded performance. We present an N-best rescoring algorithm that removes the effect of segmentation mismatch. Furthermore, we show that explicit language modeling of hidden linguistic segment boundaries is improved by including turn-boundary events in the model.

