8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Comparison of HMM and DTW Methods in Automatic Recognition of Pathological Phoneme Pronunciation

Robert Wielgat (1), Tomasz P. Zieliński (2), Paweł Świętojański (1), Piotr Żołądź (1), Daniel Król (1), Tomasz Woźniak (3), Stanisław Grabias (3)

(1) Higher State Vocational School in Tarnów, Poland
(2) AGH University of Science & Technology, Poland
(3) Maria Curie-Skłodowska University, Poland

In the paper recently proposed Human Factor Cepstral Coefficients (HFCC) are used to automatic recognition of pathological phoneme pronunciation in speech of impaired children and efficiency of this approach is compared to application of the standard Mel-Frequency Cepstral Coefficients (MFCC) as a feature vector. Both dynamic time warping (DTW), working on whole words or embedded phoneme patterns, and hidden Markov models (HMM) are used as classifiers in the presented research. Obtained results demonstrate superiority of combining HFCC features and modified phoneme-based DTW classifier.

Full Paper

Bibliographic reference.  Wielgat, Robert / Zieliński, Tomasz P. / Świętojański, Paweł / Żołądź, Piotr / Król, Daniel / Woźniak, Tomasz / Grabias, Stanisław (2007): "Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation", In INTERSPEECH-2007, 1705-1708.