10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Electrolaryngeal Speech Enhancement Based on Statistical Voice Conversion

Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano

NAIST, Japan

This paper proposes a speaking-aid system for laryngectomees using GMM-based voice conversion that converts electrolaryngeal speech (EL speech) to normal speech. Because valid F0 information cannot be obtained from the EL speech, we have so far converted the EL speech to whispering. This paper conducts the EL speech conversion to normal speech using F0 counters estimated from the spectral information of the EL speech. In this paper, we experimentally evaluate these two types of output speech of our speaking-aid system from several points of view. The experimental results demonstrate that the converted normal speech is preferred to the converted whisper.

Full Paper

Bibliographic reference.  Nakamura, Keigo / Toda, Tomoki / Saruwatari, Hiroshi / Shikano, Kiyohiro (2009): "Electrolaryngeal speech enhancement based on statistical voice conversion", In INTERSPEECH-2009, 1431-1434.