State-of-the-art unit-selection text-to-speech systems currently produce very natural synthetic speech, at the price however of a costly and time-consuming voice creation process. We report here an extensive perceptual evaluation of several voice creation strategies, and conclude with a novel 1- day process giving access to high quality TTS voices.
Index Terms: speech synthesis, unit selection, vocalic sandwich, script design, rushes, segmentation, evaluation
Cite as: Cadic, D., d'Alessandro, C. (2010) High quality TTS voices within one day. Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7), 288-293
@inproceedings{cadic10_ssw, author={Didier Cadic and Christophe d'Alessandro}, title={{High quality TTS voices within one day}}, year=2010, booktitle={Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7)}, pages={288--293} }