ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Automatic modeling of duration in a Spanish text-to-speech system using neural networks

R. Córdoba, J. A. Vallejo, J. M. Montero, J. Gutierrez-Arriola, M. A. López, Juan Manuel Pardo

Accurate prediction of segmental duration from text in a text-tospeech system is difficult for several reasons. One specially relevant is the great quantity of contextual factors that affect timing and how to model them. There are many parameters that affect duration, but not all of them are always relevant. We present a complete environment in which to decide which parameters are more relevant in different situations and the best way to code them. The system is based in a neural network absolutely configurable, and the main effort is made in the parameters to be used, including the contextual effects using windows of variable length.


doi: 10.21437/Eurospeech.1999-367

Cite as: Córdoba, R., Vallejo, J.A., Montero, J.M., Gutierrez-Arriola, J., López, M.A., Pardo, J.M. (1999) Automatic modeling of duration in a Spanish text-to-speech system using neural networks. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1619-1622, doi: 10.21437/Eurospeech.1999-367

@inproceedings{cordoba99_eurospeech,
  author={R. Córdoba and J. A. Vallejo and J. M. Montero and J. Gutierrez-Arriola and M. A. López and Juan Manuel Pardo},
  title={{Automatic modeling of duration in a Spanish text-to-speech system using neural networks}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1619--1622},
  doi={10.21437/Eurospeech.1999-367}
}