Sixth European Conference on Speech Communication and Technology
Recent developments have suggested that the use of syllables as the basic unit in a speech recognition system could be very usefull. Since syllable boundaries are more precise and well defined than phoneme ones there is a large scope for their application on the continuous speech recognition process. In this work we developed different methods of syllable segmentation in continuous speech. These methods are based on perceptually oriented feature extraction techniques. These features were post-processed through simple threshold mechanisms or by an artificial neural network based classifier in order to estimate the syllable boundaries. These systems were trained and evaluated using a Portuguese database with continuous speech. The results show that large context input windows (260 ms) are the most appropriate, achieving results of 93% detection of onsets with insertion rates of only 15%.
Full Paper (PDF) Gnu-Zipped Postscript
Bibliographic reference. Meinedo, Hugo / Neto, Joao P. / Almeida, Luis B. (1999): "Syllable onset detection applied to the portuguese language", In EUROSPEECH'99, 81-84.