INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Analysis and Synthesis of Shouted Speech

Tuomo Raitio (1), Antti Suni (2), Jouni Pohjalainen (1), Manu Airaksinen (1), Martti Vainio (2), Paavo Alku (1)

(1) Aalto University, Finland
(2) University of Helsinki, Finland

In this study, the acoustic properties of shouted speech are analyzed in relation to normal speech, and various synthesis techniques for shouting are investigated. The analysis shows large differences between the two styles, which induces difficulties in synthesis. Analysis-synthesis experiments show that the use of spectral estimation methods that are not biased by the sparse harmonics of shouted speech is beneficial. The synthesis of shouting is performed through adaptation and voice conversion. Subjective evaluation of synthesis reveals that, despite quality degradation, the impression of shouting and use of vocal effort is fairly well preserved. In addition, the use of specific spectral estimation methods is found to be beneficial also in adaptation.

Full Paper

Bibliographic reference.  Raitio, Tuomo / Suni, Antti / Pohjalainen, Jouni / Airaksinen, Manu / Vainio, Martti / Alku, Paavo (2013): "Analysis and synthesis of shouted speech", In INTERSPEECH-2013, 1544-1548.