7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Recognition of Noisy Speech Using Normalized Moments

Jingdong Chen, Yiteng (Arden) Huang, Qi Li, Frank K. Soong

Lucent Technologies, USA

Spectral subband centroid, which is essentially the first-order normalized moment, has been proposed for speech recognition and its robustness to additive noise has been demonstrated before. In this paper, we extend this concept to the use of normalized spectral subband moments (NSSM) for robust speech recognition. We show that normalized moments, if properly selected, yield comparable recognition performance as the cepstral coefficients in clean speech, while deliver a better performance than the cepstra in noisy environments. We also propose a procedure to construct the dynamic moments that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.


Full Paper

Bibliographic reference.  Chen, Jingdong / Huang, Yiteng (Arden) / Li, Qi / Soong, Frank K. (2002): "Recognition of noisy speech using normalized moments", In ICSLP-2002, 2441-2444.