INTERSPEECH 2006 - ICSLP
In this paper, we describe a method to detect syllabic nuclei in continuous speech. It employs two basic and robust acoustic features, periodicity and energy, to detect syllable landmarks. This method is evaluated on TIMIT, noise additive TIMIT and NTIMIT datasets with typical total error rates of around 30% in all the datasets, except for extremely adverse 0dB signal-noise-ratio environments, while HMMbased systems degrade rigorously. Based on the landmarks, a vowel classifier is further constructed and achieves the same performance as HMM-based systems.
Bibliographic reference. Xie, Zhimin / Niyogi, Partha (2006): "Robust acoustic-based syllable detection", In INTERSPEECH-2006, paper 1327-Wed1BuP.6.