5th International Conference on Spoken Language Processing
This paper describes a high-quality Text-to-Speech system for Spanish, based on the concatenation of diphonemes with the MBR-PSOLA algorithm. Since it was designed as a substitute of natural voice for handicapped people, it must offer a easy to hear speech, with emotional and emphatic information embedded in it. This is obtained with the prosody generator, which uses a series of phonological patterns for phonic groups and a grammatical database to vary three speech parameters: pitch, amplitude and duration. This system accepts plain text, which can be complemented with data about emotions and emphasis.
Bibliographic reference. Lacunza, Fernando / Blanco, Yolanda (1998): "High quality text-to-speech system in Spanish for handicapped people", In ICSLP-1998, paper 0596.