5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Efficient Estimation of Perceptual Features for Speech Recognition

Zhihong Hu, Etienne Barnard

Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology, Portland, OR, USA

A number of studies have shown that a pair of perceptual effective formants can be defined to capture most of the phonetic information present in vowels. Various methods of computing the effective formant values were proposed. However, many of them depend on the accuracy of conventional formant estimation. In this work, we study methods of automatically estimating perceptual effective formants without estimating the actual formant values and compare the results with the perceptually measured effective formant values. The preliminary results show that the method is effective in estimating the perceptual effective formants. Classification experiments using perceptual effective formants as explicit features do not demonstrate any advantages. However, using the perceptual effective second formant value as input to our formant estimation algorithm can help to correct up to 44% of the formant tracking errors.

Full Paper

Bibliographic reference.  Hu, Zhihong / Barnard, Etienne (1997): "Efficient estimation of perceptual features for speech recognition", In EUROSPEECH-1997, 493-496.