ISCA Archive ICSLP 1998
ISCA Archive ICSLP 1998

Improved feature decorrelation for HMM-based speech recognition

Kris Demuynck, Jacques Duchateau, Dirk Van Compernolle, Patrick Wambacq

Many HMM-based recognition systems use mixtures of diagonal covariance gaussians to model the observation density functions in the states. These mixtures are however only approximations of the real distributions. One of the approximations is the assumption that the off-diagonal elements of the covariance matrices of the gaussians are close to zero (diagonal covariance). To that end, most recognition systems have some kind of parameter decorrelation near the end of the preprocessing, e.g. the inverse cosine transform used with cepstral transformations. These transforms are however not optimal if it comes to decorrelating features on the gaussian level. This paper presents an optimal solution in a least-square sense to the decorrelation problem. It also demonstrates the link between the recently published maximum likelihood modelling for semi-tied covariance matrices and the presented least-squares optimisation. Evaluation on a large vocabulary recognition task shows a 10% relative improvement.


doi: 10.21437/ICSLP.1998-172

Cite as: Demuynck, K., Duchateau, J., Compernolle, D.V., Wambacq, P. (1998) Improved feature decorrelation for HMM-based speech recognition. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 1081, doi: 10.21437/ICSLP.1998-172

@inproceedings{demuynck98_icslp,
  author={Kris Demuynck and Jacques Duchateau and Dirk Van Compernolle and Patrick Wambacq},
  title={{Improved feature decorrelation for HMM-based speech recognition}},
  year=1998,
  booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)},
  pages={paper 1081},
  doi={10.21437/ICSLP.1998-172}
}