A statistical parametric approach to singing voice synthesis based on hidden Markov Models (HMMs) has been grown over the last few years. The spectrum, excitation, and duration of singing voices in this approach are simultaneously modeled with context-dependent HMMs and waveforms are generated from the HMMs themselves. In December 2009, we started a free on-line singing voice synthesis service called “Sinsy.” Users can obtain synthesized singing voices by uploading musical scores represented in MusicXML to the Sinsy website. The present paper describes recent developments of Sinsy in detail.
Index Terms: HMM-based speech synthesis, singing voice synthesis
Cite as: Oura, K., Mase, A., Yamada, T., Muto, S., Nankaku, Y., Tokuda, K. (2010) Recent development of the HMM-based singing voice synthesis system — Sinsy. Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7), 211-216
@inproceedings{oura10_ssw, author={Keiichiro Oura and Ayami Mase and Tomohiko Yamada and Satoru Muto and Yoshihiko Nankaku and Keiichi Tokuda}, title={{Recent development of the HMM-based singing voice synthesis system — Sinsy}}, year=2010, booktitle={Proc. 7th ISCA Workshop on Speech Synthesis (SSW 7)}, pages={211--216} }