INTERSPEECH 2004 - ICSLP
This paper presents a study of tone classification for continuous Thai speech recognition. A modified auto-correlation algorithm was implemented with pitch detection, and the tone classifier utilized 3-layer feed-forward neural network with back-propagation. The best performance configuration of tone features was obtained with semitone scaling and mean-normalization producing a classification accuracy of 72.21%. Also, after considering the effects of final consonants, the average performance of the tone classifier improved to 77.13%. Experimental results showed that the pitch value of a tone with no final consonants has more variation than one with final consonants. Also the classification for tones with voiced final consonant gave better performance than tones with unvoiced final consonants.
Bibliographic reference. Li, Tan / Karnjanadecha, Montri / Khaorapapong, Thanate (2004): "A study of tone classification for continuous Thai speech recognition", In INTERSPEECH-2004, 3033-3036.