ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

A source-filter based adaptive harmonic model and its application to speech prosody modification

JeeSok Lee, Frank K. Soong, Hong-Goo Kang

This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT).


doi: 10.21437/Interspeech.2013-9

Cite as: Lee, J., Soong, F.K., Kang, H.-G. (2013) A source-filter based adaptive harmonic model and its application to speech prosody modification. Proc. Interspeech 2013, 39-43, doi: 10.21437/Interspeech.2013-9

@inproceedings{lee13_interspeech,
  author={JeeSok Lee and Frank K. Soong and Hong-Goo Kang},
  title={{A source-filter based adaptive harmonic model and its application to speech prosody modification}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={39--43},
  doi={10.21437/Interspeech.2013-9}
}