Non-Linear Speech Processing (NOLISP 03)

May 20-23, 2003
Le Croisic, France

Vowel Classification By Global Dynamic Modeling

Xiaolin Liu, Richard J. Povinelli, Michael T. Johnson

Department of Electrical and Computer Engineering, Marquette University, Milwaukee, WI, USA

An approach is presented in this paper for vowel classification by analyzing the dynamics of speech production in a reconstructed phase space. The proposed approach has the ability of capturing nonlinearities that may exist in speech production. Global flow reconstruction is used to generate a quantitative description of the structure and trajectory of vowel attractors in a reconstructed phase space. A distance measure is defined to quantify the dynamic similarity between phoneme attractors. Templates of the dynamics for each vowel class are selected by cluster analysis. Classifying out-of-sample vowel phonemes is done using a nearest neighbor classifier. Experiments are conducted on both speaker dependent and independent vowel classification tasks using the TIMIT corpus. The preliminary experimental results show that vowel classification by nonlinear dynamics analysis can produce similar result when compared with a classifier using Mel frequency cepstral coefficient (MFCC) features.

Full Paper

Bibliographic reference.  Liu, Xiaolin / Povinelli, Richard J. / Johnson, Michael T. (2003): "Vowel classification by global dynamic modeling", In NOLISP-2003, paper 005.