ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Replacing uncertainty decoding with subband re-estimation for large vocabulary speech recognition in noise

Jianhua Lu, Ji Ming, Roger Woods

In this paper, we propose a novel approach for parameterized model compensation for large-vocabulary speech recognition in noisy environments. The new compensation algorithm, termed CMLLR-SUBREST, combines the model-based uncertainty decoding (UD) with subspace distribution clustering hidden Markov modeling (SDCHMM), so that the UD-type compensation can be realized by re-estimating the models based on small amount of adaptation data. This avoids the estimation of the covariance biases, which is required in model-based UD and usually needs a numerical approach. The Aurora 4 corpus is used in the experiments. We have achieved 16.9% relative WER (word error rate) reduction over our previous missing-feature (MF) based decoding and 16.1% over the combination of Constrained MLLR compensation and MF decoding. The number of model parameters is reduced by two orders of magnitude.


doi: 10.21437/Interspeech.2009-370

Cite as: Lu, J., Ming, J., Woods, R. (2009) Replacing uncertainty decoding with subband re-estimation for large vocabulary speech recognition in noise. Proc. Interspeech 2009, 2407-2410, doi: 10.21437/Interspeech.2009-370

@inproceedings{lu09c_interspeech,
  author={Jianhua Lu and Ji Ming and Roger Woods},
  title={{Replacing uncertainty decoding with subband re-estimation for large vocabulary speech recognition in noise}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2407--2410},
  doi={10.21437/Interspeech.2009-370}
}