ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

On the importance of components of the MFCC in speech and speaker recognition

Bin Zhen, Xihong Wu, Zhimin Liu, Huisheng Chi

In this paper, we analyzed the relative importance of components of MFCC for both speech recognition and speaker recognition using DTW recognizer in various noise environments. For English digit and under the Euclidean distance definition, the experiment results show cepstral components from C2 to C16 contain the most useful speaker information, while C0 and C1 are usually harm to speaker recognition. Cepstral terms from C1 to C12 are found to contain the most useful speech information. In both tasks, the additive noise decreases the relative importance of low MFCC terms faster than that of the middle and high MFCC terms, and the decrement depends on the speech SNR. The channel distortion will deteriorate low terms more than the middle and high MFCC terms in both tasks, also.


Cite as: Zhen, B., Wu, X., Liu, Z., Chi, H. (2000) On the importance of components of the MFCC in speech and speaker recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 487-490

@inproceedings{zhen00_icslp,
  author={Bin Zhen and Xihong Wu and Zhimin Liu and Huisheng Chi},
  title={{On the importance of components of the MFCC in speech and speaker recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 487-490}
}