Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Robust Acoustic-Based Syllable Detection

Zhimin Xie, Partha Niyogi

University of Chicago, USA

In this paper, we describe a method to detect syllabic nuclei in continuous speech. It employs two basic and robust acoustic features, periodicity and energy, to detect syllable landmarks. This method is evaluated on TIMIT, noise additive TIMIT and NTIMIT datasets with typical total error rates of around 30% in all the datasets, except for extremely adverse 0dB signal-noise-ratio environments, while HMMbased systems degrade rigorously. Based on the landmarks, a vowel classifier is further constructed and achieves the same performance as HMM-based systems.

Full Paper

Bibliographic reference.  Xie, Zhimin / Niyogi, Partha (2006): "Robust acoustic-based syllable detection", In INTERSPEECH-2006, paper 1327-Wed1BuP.6.