Sixth European Conference on Speech Communication and Technology
We examined variants of MFCC and PLP cepstral parameterisations in the context of large vocabulary continuous speech recognition under different acous-tical environmental conditions: Compared to MFCC, mel-frequency PLP uses a cubic root intensity-to-loudness law, and an LPC analysis is applied to the mel-warped spectrum. In LPC-smoothed MFCC, the only difference to MFCC is the additional LPC smoothing of the warped spectrum. While neither technique was able to significantly outperform the MFCC parameterisation in our setup which includes an LDA feature transformation, feature set combination via DMC at the acoustic likelihood level and via ROVER at the recognized word level delivered small but consistent improvements.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Haeb-Umbach, Reinhold / Loog, Marco (1999): "An investigation of cepstral parameterisations for large vocabulary speech recognition", In EUROSPEECH'99, 1323-1326.