ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Regularized feature-based maximum likelihood linear regression for speech recognition

Mohamed Kamal Omar

In many automatic speech recognition (ASR) applications, maximum likelihood linear regression (MLLR), and feature-based maximum likelihood linear regression (FMLLR) are used for speaker adaptation. This paper investigates a possible generalization of FMLLR which addresses the degradation in the performance of ASR systems due to small - possibly time-varying - perturbations of the training and the testing data. We formulate the problem as a regularized maximum likelihood linear regression problem. Based on this formulation, we describe a computationally efficient algorithm for estimating the linear regression parameters which maximize the sum of the log likelihood and the negative of a measure of the sensitivity of the estimated likelihood to these perturbations. This approach does not make any assumptions about the noise model during training and testing. We present several large vocabulary speech recognition experiments that show significant recognition accuracy improvement compared to using the speaker-adapted baseline models.


doi: 10.21437/Interspeech.2007-125

Cite as: Omar, M.K. (2007) Regularized feature-based maximum likelihood linear regression for speech recognition. Proc. Interspeech 2007, 1561-1564, doi: 10.21437/Interspeech.2007-125

@inproceedings{omar07_interspeech,
  author={Mohamed Kamal Omar},
  title={{Regularized feature-based maximum likelihood linear regression for speech recognition}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={1561--1564},
  doi={10.21437/Interspeech.2007-125}
}