Sixth European Conference on Speech Communication and Technology
The paper discusses the use, in a hybrid recognizer, of gravity centers (gc) in spectral subbands as features to be used in addition to Mel Scaled Cepstral Coefficients (MFCC) and their time derivatives. Results on noisy telephone speech show that gc computed after the nonlinear processing of an ear model increase the word accuracy from 72.63% to 78.13% .
Full Paper (PDF)
Bibliographic reference. Albesano, D. / Mori, R. De / Gemello, R. / Mana, F. (1999): "A study on the effect of adding new dimensions to trajectories in the acoustic space", In EUROSPEECH'99, 1503-1506.