10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Voice Conversion Using K-Histograms and Frame Selection

Alejandro José Uriz (1), Pablo Daniel Agüero (1), Antonio Bonafonte (2), Juan Carlos Tulli (1)

(1) Universidad Nacional de Mar del Plata, Argentina
(2) Universitat Politècnica de Catalunya, Spain

The goal of voice conversion systems is to modify the voice of a source speaker to be perceived as if it had been uttered by another specific speaker. Many approaches found in the literature work based on statistical models and introduce an oversmoothing in the target features. Our proposal is a new model that combines several techniques used in unit selection for text-to-speech and a non-gaussian transformation mathematical model. Subjective results support the proposed approach.

Full Paper

Bibliographic reference.  Uriz, Alejandro José / Agüero, Pablo Daniel / Bonafonte, Antonio / Tulli, Juan Carlos (2009): "Voice conversion using k-histograms and frame selection", In INTERSPEECH-2009, 1639-1642.