5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Hidden Markov Model Based Voice Conversion Using Dynamic Characteristics of Speaker

Eun-Kyoung Kim, Sangho Lee, Yung-Hwan Oh

Department of Computer Science Korea Advanced Institute of Science and Technology, Taejon, Korea

This paper proposes a new voice conversion technique based on hidden Markov model (HMM) for modeling of speaker's dynamic characteristics. The basic idea of this technique is to use state transition probability as speaker's dynamic characteristics and have conversion rule at each state of HMM. A couple of methods is developed for creating state-dependent conversion rule. One uses source speaker's spectral dynamics and the other uses target speaker's. The experimental results showed that the proposed methods have better performance than conventional VQ-method in both objective and subjective tests. The comparison of our two methods showed that the method using target speaker's dynamics is superior in listening test and produces more natural sound.

Full Paper

Bibliographic reference.  Kim, Eun-Kyoung / Lee, Sangho / Oh, Yung-Hwan (1997): "Hidden Markov model based voice conversion using dynamic characteristics of speaker", In EUROSPEECH-1997, 2519-2522.