INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Evaluation of a Singing Voice Conversion Method Based on Many-to-Many Eigenvoice Conversion

Hironori Doi (1), Tomoki Toda (1), Tomoyasu Nakano (2), Masataka Goto (2), Satoshi Nakamura (1)

(1) NAIST, Japan
(2) AIST, Japan

In this paper, we evaluate our proposed singing voice conversion method from various perspectives. To enable singers to freely control their voice timbre of singing voice, we have proposed a singing voice conversion method based on many-to-many eigenvoice conversion (EVC) that enables to convert the voice timbre of an arbitrary source singer into that of another arbitrary target singer using a probabilistic model. Furthermore, to easily develop training data consisting of multiple parallel data sets between a single reference singer and many other singers, a technique for efficiently and effectively generating the parallel data sets from nonparallel singing voice data sets of many singers using a singingto- singing synthesis system have been proposed. However, we have never conducted sufficient investigations into the effectiveness of these proposed methods. In this paper, we conduct both objective and subjective evaluations to carefully investigate the effectiveness of proposed methods. Moreover, the differences between singing voice conversion and speaking voice conversion are also analyzed. Experimental results show that our proposed method succeeds in enabling people to control their own voice timbre by using only an extremely small amount of the target singing voice.

Full Paper

Bibliographic reference.  Doi, Hironori / Toda, Tomoki / Nakano, Tomoyasu / Goto, Masataka / Nakamura, Satoshi (2013): "Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion", In INTERSPEECH-2013, 1067-1071.