ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Applying word duration constraints by using unrolled HMMs

Ning Ma, Jon Barker, Phil Green

Conventional HMMs have weak duration constraints. In noisy conditions, the mismatch between corrupted speech signals and models trained on clean speech may cause the decoder to produce word matches with unrealistic durations. This paper presents a simple way to incorporate word duration constraints by unrolling HMMs to form a lattice where word duration probabilities can be applied directly to state transitions. The expanded HMMs are compatible with conventional Viterbi decoding. Experiments on connected-digit recognition show that when using explicit duration constraints the decoder generates word matches with more reasonable durations, and word error rates are significantly reduced across a broad range of noise conditions.


doi: 10.21437/Interspeech.2007-105

Cite as: Ma, N., Barker, J., Green, P. (2007) Applying word duration constraints by using unrolled HMMs. Proc. Interspeech 2007, 1066-1069, doi: 10.21437/Interspeech.2007-105

@inproceedings{ma07_interspeech,
  author={Ning Ma and Jon Barker and Phil Green},
  title={{Applying word duration constraints by using unrolled HMMs}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={1066--1069},
  doi={10.21437/Interspeech.2007-105}
}