Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Further Developments in LSM-Based Boundary Training for Unit Selection TTS

Jerome R. Bellegarda

Apple Computer Inc., USA

The level of quality that can be achieved in concatenative text-to-speech synthesis depends, among other things, on a judicious segmentation of all units in the underlying unit selection inventory. We have recently advocated the iterative refinement of unit boundaries based on a data-driven feature extraction framework separately optimized for each boundary region [1]. This paper presents the formal proof of convergence of the iterative algorithm, as well as a detailed analysis of its potential benefits for concatenative TTS synthesis. A formal listening test, in particular, underscores the practical viability of the approach for unit boundary optimization.

Bibliographic reference.  Bellegarda, Jerome R. (2006): "Further developments in LSM-based boundary training for unit selection TTS", In INTERSPEECH-2006, paper 1142-Tue3BuP.7.