ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments

Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki

We present in this paper a noise robust voice conversion (VC) method for a person with an articulation disorder resulting from athetoid cerebral palsy. The movements of such speakers are limited by their athetoid symptoms, and their consonants are often unstable or unclear, which makes it difficult for them to communicate. In this paper, exemplar-based spectral conversion using Nonnegative Matrix Factorization (NMF) is applied to a voice with an articulation disorder in real noisy environments. In this paper, in order to deal with background noise, an input noisy source signal is decomposed into the clean source exemplars and noise exemplars by NMF. Also, to preserve the speaker's individuality, we use a combined dictionary that was constructed from the source speakerĀfs vowels and target speaker's consonants. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional Gaussian Mixture Model (GMM)-based method.


doi: 10.21437/Interspeech.2013-323

Cite as: Aihara, R., Takashima, R., Takiguchi, T., Ariki, Y. (2013) Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments. Proc. Interspeech 2013, 3637-3641, doi: 10.21437/Interspeech.2013-323

@inproceedings{aihara13_interspeech,
  author={Ryo Aihara and Ryoichi Takashima and Tetsuya Takiguchi and Yasuo Ariki},
  title={{Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={3637--3641},
  doi={10.21437/Interspeech.2013-323}
}