Second ESCA/IEEE Workshop on Speech Synthesis
September 12-15, 1994
Development of multiple synthesis systems requires multiple transcribed speech databases. Here we explore an automatic technique for speech segmentation into phonetic segments applied to an Italian single speaker database. The output segmentation is compared to manual segmentations by two human transcribers. The performance is very good on voiced stop to vowel boundaries and unvoiced fricative to vowel boundaries, while vowel to vowel and voiced fricative to vowel boundaries are estimated less accurately.
Bibliographic reference. Ljolje, Andrej / Hirschberg, Julia / Santen, Jan P. H. van (1994): "Automatic speech segmentation for concatenative inventory selection", In SSW2-1994, 93-96.