EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Text-to-Speech Scripting Interface for Appropriate Vocalisation of e-Texts

Gerasimos Xydas, Georgios Kouroupetroglou

University of Athens, Greece

Electronic texts carry important meta-information (such as tags in HTML) that most of the current Text-to-Speech (TtS) systems ignore during the production of the speech. We propose an approach to exploit this meta-information in order to achieve a detailed auditory representation of an e-text. The e-Text to Speech and Audio (e-TSA) Composer has been designed and developed as an XML based scripting framework that can be adopted by existing TtS, with minor or major modifications. It provides a mechanism to create scripts using combined elements from e-texts and TtS systems. The e-TSA Composer can manipulate the behaviour of a TtS (e.g. the applied prosody) in order to define a finest vocalisation in response to specific e-texts.

Full Paper

Bibliographic reference.  Xydas, Gerasimos / Kouroupetroglou, Georgios (2001): "Text-to-speech scripting interface for appropriate vocalisation of e-texts", In EUROSPEECH-2001, 2247-2250.