ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Invariant-integration method for robust feature extraction in speaker-independent speech recognition

Florian Müller, Alfred Mertins

The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the time-frequency analysis, a change in VTL can be approximately described by a translation in the subband-index space. We present a new type of features that is based on the principle of invariant integration, and an according feature selection method is described. ASR experiments show the increased robustness of the proposed features in comparison to standard MFCCs.


doi: 10.21437/Interspeech.2009-753

Cite as: Müller, F., Mertins, A. (2009) Invariant-integration method for robust feature extraction in speaker-independent speech recognition. Proc. Interspeech 2009, 2975-2978, doi: 10.21437/Interspeech.2009-753

@inproceedings{muller09b_interspeech,
  author={Florian Müller and Alfred Mertins},
  title={{Invariant-integration method for robust feature extraction in speaker-independent speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2975--2978},
  doi={10.21437/Interspeech.2009-753}
}