ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

The linear transformation of LF glottal waveforms for voice conversion

Arantza del Pozo, Steve Young

Most Voice Conversion (VC) systems exploit source-filter decomposition based on linear prediction (LP) to transform spectral envelopes, incurring as a result various issues related to the oversimplification of the LP voice source model. Whilst residual prediction methods can mitigate this problem, they cannot be used to modify voice source quality. In this paper, a system which employs linear transformations to convert both the spectral envelope and the LF glottal waveform is presented. Its performance is shown to be comparable to that of a state-of-the-art VC implementation in terms of speaker identity conversion but its output has better quality. In addition, it is also capable of transforming the quality of the voice source.


doi: 10.21437/Interspeech.2008-420

Cite as: Pozo, A.d., Young, S. (2008) The linear transformation of LF glottal waveforms for voice conversion. Proc. Interspeech 2008, 1457-1460, doi: 10.21437/Interspeech.2008-420

@inproceedings{pozo08_interspeech,
  author={Arantza del Pozo and Steve Young},
  title={{The linear transformation of LF glottal waveforms for voice conversion}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1457--1460},
  doi={10.21437/Interspeech.2008-420}
}