Synthesis systems, based on unit selection from databases with a large number of unit examples from different phonetic and prosodic contexts, have been shown to allow a high quality speech synthesis [1]. One of the dificult problems associated with this kind of synthesis is determining the strategy for unit selection, and tuning its parameters. In this work we suggest a way to extend the usefulness of two existing training methods, by using phoneme pairs as the basic comparison unit. Using unit pairs is shown to significantly increase the efifciency of the exhaustive weight search training method on one hand, and refining the regression weight training method on the other.
Cite as: Meron, Y., Hirose, K. (1999) Efficient weight training for selection based synthesis. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2319-2322, doi: 10.21437/Eurospeech.1999-506
@inproceedings{meron99_eurospeech, author={Yoram Meron and Keikichi Hirose}, title={{Efficient weight training for selection based synthesis}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={2319--2322}, doi={10.21437/Eurospeech.1999-506} }