ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Analysis of major factors of naturalness degradation in concatenative synthesis

Toshio Hirai, Hisashi Kawai, Minoru Tsuzaki, Nobuyuki Nishizawa

To effectively improve a speech synthesis system, it is important to find and focus on improving the modules whose effect on naturalness degradation in synthesized speech are the largest. In this paper, we describe the design of a perception experiment to measure the effect of each module separately. Synthesized speech stimuli whose intermediate information is modified during a synthesis process are used in the experiment. A perception experiment in which a Japanese concatenative speech synthesis system was evaluated revealed that the text processing module and a part of the feature prediction module (for the fundamental frequency) of the system were the major factors in degrading naturalness.


doi: 10.21437/Interspeech.2005-603

Cite as: Hirai, T., Kawai, H., Tsuzaki, M., Nishizawa, N. (2005) Analysis of major factors of naturalness degradation in concatenative synthesis. Proc. Interspeech 2005, 1925-1928, doi: 10.21437/Interspeech.2005-603

@inproceedings{hirai05_interspeech,
  author={Toshio Hirai and Hisashi Kawai and Minoru Tsuzaki and Nobuyuki Nishizawa},
  title={{Analysis of major factors of naturalness degradation in concatenative synthesis}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={1925--1928},
  doi={10.21437/Interspeech.2005-603}
}