ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Thai monophthong recognition using continuous density hidden Markov model and LPC cepstral coefficients

Ekkarit Maneenoi, Somchai Jitapunkul, Visarut Ahkuputra, Umavasee Thathong, Boonchai Thampanitchawong, Sudaporn Luksaneeyanawin

This paper presents Thai monophthongs recognition. The monophthongs were qualitatively recognized by the 3-state left-to-right continuous density hidden Markov model. The LPC cepstral coefficients were used as feature which represented specch signal. The temporal cepstral derivative was additionally utilized in order to compare efficiency of the additional feature with that of the single LPC cepstral coefficients. The number of coefficient orders was varied in order to determine an appropriate order. Thai single, double, and triple polysyllabic words were used in this research. The 18 monophthongs from the polysyllabic words were qualitatively recognized as 9 different vowels. The highest recognition rate of the single feature obtained from 18-order LPC cepstral coefficient is 86.983 percent, while the recognition rate of the 16-order LPC cepstral coefficient accompanied by temporal derivative is 94.580 percent. The misclassification is examined and concluded that this resulted from excessively overlapped distributions of vowels in low and in back vowel group respectively.


Cite as: Maneenoi, E., Jitapunkul, S., Ahkuputra, V., Thathong, U., Thampanitchawong, B., Luksaneeyanawin, S. (2000) Thai monophthong recognition using continuous density hidden Markov model and LPC cepstral coefficients. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 620-623

@inproceedings{maneenoi00_icslp,
  author={Ekkarit Maneenoi and Somchai Jitapunkul and Visarut Ahkuputra and Umavasee Thathong and Boonchai Thampanitchawong and Sudaporn Luksaneeyanawin},
  title={{Thai monophthong recognition using continuous density hidden Markov model and LPC cepstral coefficients}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 620-623}
}