Interspeech'2005 - Eurospeech
This paper presents a Non-Uniform Units selection-based Text- To-Speech synthesizer. Nowadays, systems use prosodic models that do not allow the prosody to vary as far as we should hope, involving a listening comfort degradation. Our system has the advantage to avoid the using of prosodic model. Speech units selection builds its features set exclusively from the linguistic information generated by the natural language analysis. We also present an original method to automatically weight these features. Therefore, selected units are not restricted by a predetermined prosody. With only using linguistic features, we obtain a various prosody and the units concatenation is performed without resort to heavy signal processing.
Bibliographic reference. Colotte, Vincent / Beaufort, Richard (2005): "Linguistic features weighting for a text-to-speech system without prosody model", In INTERSPEECH-2005, 2549-2552.