We describe the attempt to synthesize emotional speech with a concatenative speech synthesizer using a parameter space covering not only f0, duration and amplitude, but also voice quality parameters, spectral energy distribution, harmonics-to-noise ratio, and articulatory precision. The application of these extended parameter set offers the possibility to combine the high segmental quality of concatenative synthesis with a wider range of control settings needed for the synthesis of natural affected speech.
Cite as: Rank, E., Pirker, H. (1998) Generating emotional speech with a concatenative synthesizer. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0975, doi: 10.21437/ICSLP.1998-134
@inproceedings{rank98_icslp, author={Erhard Rank and Hannes Pirker}, title={{Generating emotional speech with a concatenative synthesizer}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0975}, doi={10.21437/ICSLP.1998-134} }