8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Tagging Syllable Boundaries with Joint N-Gram Models

Helmut Schmid, Bernd Möbius, Julia Weidenkaff

University of Stuttgart, Germany

This paper presents a statistical method for the segmentation of words into syllables which is based on a joint n-gram model. Our system assigns syllable boundaries to phonetically transcribed words. The syllabification task was formulated as a tagging task. The syllable tagger was trained on syllable-annotated phone sequences. In an evaluation using ten-fold cross-validation, the system correctly predicted the syllabification of German words with an accuracy by word of 99.85%, which clearly exceeds results previously reported in the literature. The best performance was observed for a context size of five preceding phones. A detailed qualitative error analysis suggests that a further reduction of the error rate by up to 90% is possible by eliminating inconsistencies in the training database.

Full Paper

Bibliographic reference.  Schmid, Helmut / Möbius, Bernd / Weidenkaff, Julia (2007): "Tagging syllable boundaries with joint n-gram models", In INTERSPEECH-2007, 2857-2860.