Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Context-Dependent Word Duration Modelling for Robust Speech Recognition

Ning Ma, Phil Green

University of Sheffield, UK

Conventional hidden Markov models (HMMs) have weak duration constraints. This may cause the decoder to produce word matches with unrealistic durations in noisy situations. This paper describes techniques for modelling context-dependent word duration cues and incorporating them directly in a multi-stack decoding algorithm. The proposed model is capable of penalising duration constraints of a word depending on its context. Experiments on connected digit recognition show that the new system can significantly improve recognition performance at different noise levels.

Full Paper

Bibliographic reference.  Ma, Ning / Green, Phil (2005): "Context-dependent word duration modelling for robust speech recognition", In INTERSPEECH-2005, 2609-2612.