ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

High-accuracy automatic segmentation

Jan P. H. van Santen, Richard W. Sproat

We propose a system for automatically determining boundaries between phonetic segments in a speech wave given a phonetic transcription: automatic segmentation. The system uses edge detectors that are applied to various speech representations; both are optimized for each diphone or diphone class. Output from these detectors, which contains spuriously detected edges, is then combined with alternative pronunciations generated via rules from the canonical pronunciation. The final output is generated with lowest-cost path algorithms applied to finite state transducers.

doi: 10.21437/Eurospeech.1999-620

Cite as: Santen, J.P.H.v., Sproat, R.W. (1999) High-accuracy automatic segmentation. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2809-2812, doi: 10.21437/Eurospeech.1999-620

  author={Jan P. H. van Santen and Richard W. Sproat},
  title={{High-accuracy automatic segmentation}},
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},