ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Unsupervised training scheme with non-stereo data for empirical feature vector compensation

L. Buera, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Richard M. Stern

In this paper, a novel training scheme based on unsupervised and non-stereo data is presented for Multi-Environment Model-based LInear Normalization (MEMLIN) and MEMLIN with cross-probability model based on GMMs (MEMLIN-CPM). Both are data-driven feature vector normalization techniques which have been proved very effective in dynamic noisy acoustic environments. However, this kind of techniques usually requires stereo data in a previous training phase, which could be an important limitation in real situations. To compensate this drawback, we present an approach based on ML criterion and Vector Taylor Series (VTS). Experiments have been carried out with Spanish SpeechDat Car, reaching consistent improvements: 48.7% and 61.9% when the novel training process is applied over MEMLIN and MEMLIN-CPM, respectively.


doi: 10.21437/Interspeech.2009-359

Cite as: Buera, L., Miguel, A., Ortega, A., Lleida, E., Stern, R.M. (2009) Unsupervised training scheme with non-stereo data for empirical feature vector compensation. Proc. Interspeech 2009, 1247-1250, doi: 10.21437/Interspeech.2009-359

@inproceedings{buera09_interspeech,
  author={L. Buera and Antonio Miguel and Alfonso Ortega and Eduardo Lleida and Richard M. Stern},
  title={{Unsupervised training scheme with non-stereo data for empirical feature vector compensation}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1247--1250},
  doi={10.21437/Interspeech.2009-359}
}