INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Synthesis by Generation and Concatenation of Multiform Segments

Vincent Pollet, Andrew Breen

Nuance Communications, Belgium

Machine generated speech can be produced in different ways however there are two basic methods for synthesizing speech in widespread use. One method generates speech from models, while the other method concatenates pre-stored speech segments. This paper presents a speech synthesis technique where these two basic synthesis methods are combined in a statistical framework. Synthetic speech is constructed by generation and concatenation of so-called "multiform segments". Multiform segments are different speech signal representations; synthesis models, templates and synthesis models augmented with template information. An evaluation of the multiform segment synthesis technique shows improvements over traditional concatenative methods of synthesis.

Full Paper

Bibliographic reference.  Pollet, Vincent / Breen, Andrew (2008): "Synthesis by generation and concatenation of multiform segments", In INTERSPEECH-2008, 1825-1828.