EUROSPEECH 2003 - INTERSPEECH 2003
In the development of a syllable-centric ASR system, segmentation of the acoustic signal into syllabic units is an important stage. This paper presents a minimum phase group delay based approach to segment spontaneous speech into syllable-like units. Here, three different minimum phase signals are derived from the short term energy functions of three sub-bands of speech signals, as if it were a magnitude spectrum. The experiments are carried out on Switchboard and OGI-MLTS corpus and the error in segmentation is found to be utmost 40msec for 85% of the syllable segments.
Bibliographic reference. Nagarajan, T. / Murthy, Hema A. / Hegde, Rajesh M. (2003): "Segmentation of speech into syllable-like units", In EUROSPEECH-2003, 2893-2896.