INTERSPEECH 2006 - ICSLP
The level of quality that can be achieved in concatenative text-to-speech synthesis depends, among other things, on a judicious segmentation of all units in the underlying unit selection inventory. We have recently advocated the iterative refinement of unit boundaries based on a data-driven feature extraction framework separately optimized for each boundary region . This paper presents the formal proof of convergence of the iterative algorithm, as well as a detailed analysis of its potential benefits for concatenative TTS synthesis. A formal listening test, in particular, underscores the practical viability of the approach for unit boundary optimization.
Bibliographic reference. Bellegarda, Jerome R. (2006): "Further developments in LSM-based boundary training for unit selection TTS", In INTERSPEECH-2006, paper 1142-Tue3BuP.7.