ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Controlling “shout” expression in a Japanese POP singing performance: analysis and suppression study

Yuri Nishigaki, Ken-Ichi Sakakibara, Masanori Morise, Ryuichi Nisimura, Toshio Irino, Hideki Kawahara

Degree of "shout" singing performance is effectively controlled by combining global spectral shape equalization, peak cancellation in frequency modulation spectrum of F0 trajectory, and synchronized shape-modulation of voice spectral envelope. This "shout-reduction" processing is based on a symmetry-based F0 extractor with fine temporal resolution, a temporally stable representation of instantaneous frequency of periodic signals, and the TANDEM-STRAIGHT, a speech analysis, modification and resynthesis framework. The proposed procedure successfully converted an expressive Japanese POP song performance with "shout" into a plain performance without damaging original naturalness. Possibility of adding artificial "shout" to plain performance is also discussed.


doi: 10.21437/Interspeech.2013-649

Cite as: Nishigaki, Y., Sakakibara, K.-I., Morise, M., Nisimura, R., Irino, T., Kawahara, H. (2013) Controlling “shout” expression in a Japanese POP singing performance: analysis and suppression study. Proc. Interspeech 2013, 2905-2909, doi: 10.21437/Interspeech.2013-649

@inproceedings{nishigaki13_interspeech,
  author={Yuri Nishigaki and Ken-Ichi Sakakibara and Masanori Morise and Ryuichi Nisimura and Toshio Irino and Hideki Kawahara},
  title={{Controlling “shout” expression in a Japanese POP singing performance: analysis and suppression study}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2905--2909},
  doi={10.21437/Interspeech.2013-649}
}