ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

HMM composition of segmental unit input HMM for noisy speech recognition

Kazumasa Yamamoto, Seiichi Nakagawa

For robust speech recognition in noisy environments, various methods have been studied. In this paper, we apply parallel model combination (PMC) for segmental unit input HMM to recognize corrupted speech in additive noise. Since several successive frames are combined and treated as an input vector in segmental unit input modeling, the increased dimension of vector degrades the precision in estimating covariance matrices. Therefore Karhunen-Loeve expansion or LDA is used to reduce the dimension. Thus the inverse transformation of segmental statistics to cepstral domain is needed and correlations between frames have to be taken into account. We expanded the original PMC to segmental unit input HMM. Experimental results showed PMC for segmental unit input HMM proposed here gives better recognition performance than the original PMC.


doi: 10.21437/Eurospeech.1999-634

Cite as: Yamamoto, K., Nakagawa, S. (1999) HMM composition of segmental unit input HMM for noisy speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2865-2868, doi: 10.21437/Eurospeech.1999-634

@inproceedings{yamamoto99b_eurospeech,
  author={Kazumasa Yamamoto and Seiichi Nakagawa},
  title={{HMM composition of segmental unit input HMM for noisy speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2865--2868},
  doi={10.21437/Eurospeech.1999-634}
}