ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese

Nobuyuki Nishizawa, Nobuaki Minematsu, Keikichi Hirose

Although flexible control of acoustic features is possible in formant-based speech synthesizers, their development requires precise estimation of parameters related to vocal tract and source. This requirement is difficult to satisfy and often results in limiting quality of the synthesized speech. The difficulty is derived from the fact that estimation of the parameters is a nonlinear problem. Therefore, the completely automatic estimation of the parameters is quite difficult and some approximations or manual modifications of parameters with a priori knowledge are required in the development. In this study, mainly to make the estimation more efficient and/or to assist developers doing the manual modifications of parameters, a formant-based analysissynthesis system is build. The system introduces pitchsynchronous acoustic analysis to reduce fluctuation of the estimated parameters. Experiments show that quality of synthetic speech of Japanese /r/ sounds is significantly improved by using the proposed system.


Cite as: Nishizawa, N., Minematsu, N., Hirose, K. (2000) Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 725-728

@inproceedings{nishizawa00_icslp,
  author={Nobuyuki Nishizawa and Nobuaki Minematsu and Keikichi Hirose},
  title={{Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 725-728}
}