Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

Modeling Word Durations

Venkata Ramana Rao Gadde

Speech Technology and Research Laboratory, SRI International, Menlo Park, CA, USA

We describe a new method of modeling duration at word level. These duration models are easily trained from the acoustic training data and can be used to rescore N-best lists of recognition hypotheses. The models capture some of the well known durational effects such as prepausal lengthening. They incorporate a simple back off mechanism to handle unseen words during rescoring. Experiments with various large vocabulary conversational speech recognition (LVCSR) evaluation sets showed consistent improvements of 0.7-1.0% in word error rate (WER).


Full Paper

Bibliographic reference.  Gadde, Venkata Ramana Rao (2000): "Modeling word durations", In ICSLP-2000, vol.1, 601-604.