ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Local refinement of phonetic boundaries: a general framework and its application using different transition models

Doroteo Torre Toledano, Luis A. Hernández Gómez

In the last few years we have been experimenting with an automatic phonetic segmentation and labeling system based on a modified HMM phonetic recognizer followed by a local phonetic boundary refinement system. During this period we have used different approaches for the local refinement, including fuzzy rules and neural networks. In this paper we present a unified framework for the local refinement of phonetic boundaries that has allowed us to thoroughly evaluate and compare these approaches and yet another one based on gaussian mixture models. Results show that neural networks outperform the rest of the approaches in speaker dependent mode, achieving a precision almost equal to a manual segmentation. In speaker independent mode, however, neural networks and fuzzy rules achieve almost the same performance, a bit worse than a manual segmentation.


doi: 10.21437/Eurospeech.2001-397

Cite as: Toledano, D.T., Gómez, L.A.H. (2001) Local refinement of phonetic boundaries: a general framework and its application using different transition models. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1695-1698, doi: 10.21437/Eurospeech.2001-397

@inproceedings{toledano01_eurospeech,
  author={Doroteo Torre Toledano and Luis A. Hernández Gómez},
  title={{Local refinement of phonetic boundaries: a general framework and its application using different transition models}},
  year=2001,
  booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)},
  pages={1695--1698},
  doi={10.21437/Eurospeech.2001-397}
}