ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

HMM-based Finnish text-to-speech system utilizing glottal inverse filtering

Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku

This paper describes an HMM-based speech synthesis system that utilizes glottal inverse filtering for generating natural sounding synthetic speech. In the proposed system, speech is first parametrized into spectral and excitation features using a glottal inverse filtering based method. The parameters are fed into an HMM system for training and then generated from the trained HMM according to text input. Glottal flow pulses extracted from real speech are used as a voice source, and the voice source is further modified according to the all-pole model parameters generated by the HMM. Preliminary experiments show that the proposed system is capable of generating natural sounding speech, and the quality is clearly better compared to a system utilizing a conventional impulse train excitation model.


doi: 10.21437/Interspeech.2008-189

Cite as: Raitio, T., Suni, A., Pulakka, H., Vainio, M., Alku, P. (2008) HMM-based Finnish text-to-speech system utilizing glottal inverse filtering. Proc. Interspeech 2008, 1881-1884, doi: 10.21437/Interspeech.2008-189

@inproceedings{raitio08_interspeech,
  author={Tuomo Raitio and Antti Suni and Hannu Pulakka and Martti Vainio and Paavo Alku},
  title={{HMM-based Finnish text-to-speech system utilizing glottal inverse filtering}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1881--1884},
  doi={10.21437/Interspeech.2008-189}
}