ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Electrolaryngeal speech enhancement based on statistical voice conversion

Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano

This paper proposes a speaking-aid system for laryngectomees using GMM-based voice conversion that converts electrolaryngeal speech (EL speech) to normal speech. Because valid F0 information cannot be obtained from the EL speech, we have so far converted the EL speech to whispering. This paper conducts the EL speech conversion to normal speech using F0 counters estimated from the spectral information of the EL speech. In this paper, we experimentally evaluate these two types of output speech of our speaking-aid system from several points of view. The experimental results demonstrate that the converted normal speech is preferred to the converted whisper.


doi: 10.21437/Interspeech.2009-439

Cite as: Nakamura, K., Toda, T., Saruwatari, H., Shikano, K. (2009) Electrolaryngeal speech enhancement based on statistical voice conversion. Proc. Interspeech 2009, 1431-1434, doi: 10.21437/Interspeech.2009-439

@inproceedings{nakamura09_interspeech,
  author={Keigo Nakamura and Tomoki Toda and Hiroshi Saruwatari and Kiyohiro Shikano},
  title={{Electrolaryngeal speech enhancement based on statistical voice conversion}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1431--1434},
  doi={10.21437/Interspeech.2009-439}
}