ISCA Archive SSW 2010
ISCA Archive SSW 2010

An HMM-based singing style modeling system for singing voice synthesizers

Keijiro Saino, Makoto Tachibana, Hideki Kenmochi

This paper describes a method of modeling singing styles by a statistical method. In this system, singing expression parameters consisting of melody and dynamics which are derived from fundamental frequency (F0) and power are modeled by context-dependent Hidden Markov Models (HMMs.) A modeling method of the parameters is optimized for dealing with them. Since parameters we focus on are general ones for singing synthesizers, generated parameters from the trained models may be applicable to many of them. As a result, parameters which can produce an “expressive” synthesis sound are automatically generated from trained models using score data of arbitrary songs. We trained singing style models in the experiment by using recorded singing voice with a much expressive style. Parameters generated for songs not included in training data were applied to our singing synthesizer VOCALOID. As a result, the style was well perceived in the synthesized sound with enough naturalness.

Index Terms: singing voice synthesis, singing style, HMM


Cite as: Saino, K., Tachibana, M., Kenmochi, H. (2010) An HMM-based singing style modeling system for singing voice synthesizers. Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7), 252-257

@inproceedings{saino10_ssw,
  author={Keijiro Saino and Makoto Tachibana and Hideki Kenmochi},
  title={{An HMM-based singing style modeling system for singing voice synthesizers}},
  year=2010,
  booktitle={Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7)},
  pages={252--257}
}