EUROSPEECH 2003 - INTERSPEECH 2003
This paper presents a segmental durations' model applied to the European Portuguese language for TTS purposes. The model is based on a feed-forward neural network, trained with a back-propagation algorithm, and has as input a set of phonological and contextual features, automatically extracted from the text. The relative importance of each feature, concerning the correlation with segmental durations and improvements in the performance of the model, is presented. Finally the model is evaluated objectively and subjectively by a perceptual test.
Bibliographic reference. Teixeira, Joao Paulo / Freitas, Diamantino (2003): "Segmental durations predicted with a neural network", In EUROSPEECH-2003, 169-172.