ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Modeling trajectories in the HMM framework

Rukmini Iyer, Owen Kimball, Herbert Gish

Most state-of-the-art statistical speech recognition systems use hidden Markov models (HMM) for modeling the speech signal. However, limited by the assumption of conditional independence of observations given the state se-quence, current HMM's poorly model the trajectory con-straints in speech. In [1], we introduced the parallel path HMM, where each phonetic unit is represented by a parallel collection of HMM's that model the phone trajectory variability. The trajectory constraint is imposed by disallowing transitions across parallel paths. In this paper,we investigate improvements to two critical components ofthis new framework: (i) initializing the sets of trajectoriesper phone that will form the basis of the parallel collection of HMM's, and (ii) evaluating alternative parameter shar-ing strategies related to distributing the number of model parameters. Recognition results on Switchboard, a large vocabulary conversational speech recognition task, demonstrate 0.7-1.0% absolute performance improvements with the parallel path HMM in the N-best rescoring paradigm.

doi: 10.21437/Eurospeech.1999-123

Cite as: Iyer, R., Kimball, O., Gish, H. (1999) Modeling trajectories in the HMM framework. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 479-482, doi: 10.21437/Eurospeech.1999-123

  author={Rukmini Iyer and Owen Kimball and Herbert Gish},
  title={{Modeling trajectories in the HMM framework}},
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},