Sixth International Conference on Spoken Language Processing
October 16-20, 2000
Modeling Word Durations
Venkata Ramana Rao Gadde
Speech Technology and Research Laboratory,
SRI International, Menlo Park, CA, USA
We describe a new method of modeling duration at word
level. These duration models are easily trained from the
acoustic training data and can be used to rescore N-best
lists of recognition hypotheses. The models capture
some of the well known durational effects such as
prepausal lengthening. They incorporate a simple back
off mechanism to handle unseen words during rescoring.
Experiments with various large vocabulary
conversational speech recognition (LVCSR) evaluation
sets showed consistent improvements of 0.7-1.0% in
word error rate (WER).
Gadde, Venkata Ramana Rao (2000):
"Modeling word durations",
In ICSLP-2000, vol.1, 601-604.