EUROSPEECH '91
Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991

                 

Comparison of Formant Transition Based Stop Classifiers: Time-Varying and Time-Invariant Signal Models

Krishna S. Nathan

IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY, USA

A feature set that captures the dynamics of formant transitions is utilized to classify the unvoiced stop consonants. The second formant and its slope are used to characterize the transition between the vowel and closure in a VCV environment. The performance of a feature set obtained by means of a time-varying, data-selective model for the signal is compared with that of a standard time-invariant (LPC) and data-selective, time-invariant models. The different feature sets are evaluated on a database consisting of 10 talkers. A two-fold reduction in the error rate is obtained by means of the time-varying model. The performance of three different classifiers is presented. A novel adaptive algorithm, termed Learning Vector Classifier, is compared with standard K-means and LVQ2 classifiers. Speaker independent error rates of 11% are obtained for the 3-way classification task. Further improvements, expected when an expanded time-varying feature set is coupled with information from the burst, could lead to very high quality stop classification.

Full Paper

Bibliographic reference.  Nathan, Krishna S. (1991): "Comparison of formant transition based stop classifiers: time-varying and time-invariant signal models", In EUROSPEECH-1991, 147-150.