Spectral feature computations continue to be a very difficult problem for accurate machine recognition of vowels especially in the presence of noise or for otherwise degraded acoustic signals. In this work, a new peak envelope method for vowel classification is developed, based on a missing frequency components model of speech recognition. According to this model, vowel recognition depends only on the location of spectral peaks. Also, smoothing and interpolation of the sampled spectra, performed in the cepstral analysis method commonly used in automatic speech recognition results in a loss of valuable information. The new method for feature extraction presented in this paper is based on minimum mean square error curve fitting of cosine-like basis vectors to all peaks in the speech spectrum. A mathematical model for smoothly tracking spectral envelopes using only spectral peak information and ignoring other parts of the spectrum is presented. A software algorithm for the model was developed and tested for various speaker types using a neural network classifier. Vowel classification experiments were conducted based on the features derived from the spectral peaks. The classification rates of the peak method under various signal to noise ratios was also evaluated. The basic conclusion is that the new features perform the same as cepstral features for clean speech, but have advantages when the signal is degraded by noise.
Cite as: Venugopal, J., Zahorian, S.A., Karnjanadecha, M. (2000) Minimum mean square error spectral peak envelope estimation for automatic vowel classification. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 1081-1084, doi: 10.21437/ICSLP.2000-461
@inproceedings{venugopal00_icslp, author={Jaishree Venugopal and Stephen A. Zahorian and Montri Karnjanadecha}, title={{Minimum mean square error spectral peak envelope estimation for automatic vowel classification}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 2, 1081-1084}, doi={10.21437/ICSLP.2000-461} }