In this study, the acoustic properties of shouted speech are analyzed in relation to normal speech, and various synthesis techniques for shouting are investigated. The analysis shows large differences between the two styles, which induces difficulties in synthesis. Analysis-synthesis experiments show that the use of spectral estimation methods that are not biased by the sparse harmonics of shouted speech is beneficial. The synthesis of shouting is performed through adaptation and voice conversion. Subjective evaluation of synthesis reveals that, despite quality degradation, the impression of shouting and use of vocal effort is fairly well preserved. In addition, the use of specific spectral estimation methods is found to be beneficial also in adaptation.
Bibliographic reference. Raitio, Tuomo / Suni, Antti / Pohjalainen, Jouni / Airaksinen, Manu / Vainio, Martti / Alku, Paavo (2013): "Analysis and synthesis of shouted speech", In INTERSPEECH-2013, 1544-1548.