Fifth ISCA ITRW on Speech Synthesis
June 14-16, 2004
This note describes a framework used to simulate three basic emotional styles by means of prosodic transplantation techniques applied to the output of a corpus based speech synthesis system. The target pitch profiles together with duration and energy constraints have been obtained applying simple rules inferred from the analysis of a small corpus, recorded in three emotional styles. Results of perceptual tests show that styles are well recognized even if the acoustical quality, in some cases, degrades.
Bibliographic reference. Zovato, Enrico / Pacchiotti, Alberto / Quazza, Silvia / Sandri, Stefano (2004): "Towards emotional speech synthesis: a rule based approach", In SSW5-2004, 219-220.