ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Analysis and synthesis of shouted speech

Tuomo Raitio, Antti Suni, Jouni Pohjalainen, Manu Airaksinen, Martti Vainio, Paavo Alku

In this study, the acoustic properties of shouted speech are analyzed in relation to normal speech, and various synthesis techniques for shouting are investigated. The analysis shows large differences between the two styles, which induces difficulties in synthesis. Analysis-synthesis experiments show that the use of spectral estimation methods that are not biased by the sparse harmonics of shouted speech is beneficial. The synthesis of shouting is performed through adaptation and voice conversion. Subjective evaluation of synthesis reveals that, despite quality degradation, the impression of shouting and use of vocal effort is fairly well preserved. In addition, the use of specific spectral estimation methods is found to be beneficial also in adaptation.


doi: 10.21437/Interspeech.2013-391

Cite as: Raitio, T., Suni, A., Pohjalainen, J., Airaksinen, M., Vainio, M., Alku, P. (2013) Analysis and synthesis of shouted speech. Proc. Interspeech 2013, 1544-1548, doi: 10.21437/Interspeech.2013-391

@inproceedings{raitio13_interspeech,
  author={Tuomo Raitio and Antti Suni and Jouni Pohjalainen and Manu Airaksinen and Martti Vainio and Paavo Alku},
  title={{Analysis and synthesis of shouted speech}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1544--1548},
  doi={10.21437/Interspeech.2013-391}
}