ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognition

Rathinavelu Chengalvarayan

Previous studies showed that a significantly enhanced recognition performance can be achieved by incorporating information about HMM duration along with the cepstral parameters. The reestimation formula for the duration parameters have been derived in the past using fixed segmentation during K-means training and the duration statistics are always fixed throughout the additional minimum string error (MSE) training process. In this study, we update the duration parameters along with other model parameters during discriminative training iterations. The convergence property of the training property based on the MSE approach is investigated, and experimental results on wireline connected digit recognition task demonstrated a 6% word error rate reduction by using the newly trained duration model parameters as compared to fixed duartion parameters during MSE training.


doi: 10.21437/ICSLP.1998-641

Cite as: Chengalvarayan, R. (1998) Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0021, doi: 10.21437/ICSLP.1998-641

@inproceedings{chengalvarayan98b_icslp,
  author={Rathinavelu Chengalvarayan},
  title={{Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognition}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 0021},
  doi={10.21437/ICSLP.1998-641}
}