ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis

Martti Vainio, Antti Suni, Tuomo Raitio, Jani Nurminen, Juhani Järvikivi, Paavo Alku

This paper describes a new flexible delexicalization method based on glottal excited parametric speech synthesis scheme. The system utilizes inverse filtered glottal flow and all-pole modelling of the vocal tract. The method provides a possibility to retain and manipulate all relevant prosodic features of any kind of speech. Most importantly, the features include voice quality, which has not been properly modeled in earlier delexicalization methods. The functionality of the new method was tested in a prosodic tagging experiment aimed at providing word prominence data for a text-to-speech synthesis system. The experiment confirmed the usefulness of the method and further corroborated earlier evidence that linguistic factors influence the perception of prosodic prominence.


doi: 10.21437/Interspeech.2009-514

Cite as: Vainio, M., Suni, A., Raitio, T., Nurminen, J., Järvikivi, J., Alku, P. (2009) New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis. Proc. Interspeech 2009, 1703-1706, doi: 10.21437/Interspeech.2009-514

@inproceedings{vainio09_interspeech,
  author={Martti Vainio and Antti Suni and Tuomo Raitio and Jani Nurminen and Juhani Järvikivi and Paavo Alku},
  title={{New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1703--1706},
  doi={10.21437/Interspeech.2009-514}
}