ISCA Archive ICSLP 1994
ISCA Archive ICSLP 1994

Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis

Kazuo Hakoda, Tomohisa Hirokawa, Kenzo Itoh

This paper describes a new speech editor based on enhanced user-system interaction that produces high quality synthesized speech by using an advanced text-to-speech synthesis method. A prototype system is constructed on a work station with the Open Window system. Features of the prototype are that the operator can correct the faults of the text-to-speech synthesis method and produce high quality synthesized speech from input Japanese text. System operation has been optimized by adopting a real-time synthesizer and a GUI design based on mouse operations. A system evaluation confirms that character level correction is very effective for improving synthesized speech quality. The proposed system can be used to provide voice messages for a conventional digital audio response unit at low cost.


Cite as: Hakoda, K., Hirokawa, T., Itoh, K. (1994) Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis. Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 1775-1778

@inproceedings{hakoda94_icslp,
  author={Kazuo Hakoda and Tomohisa Hirokawa and Kenzo Itoh},
  title={{Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis}},
  year=1994,
  booktitle={Proc. 3rd International Conference on Spoken Language Processing (ICSLP 1994)},
  pages={1775--1778}
}