Third International Conference on Spoken Language Processing (ICSLP 94)
This paper describes a new speech editor based on enhanced user-system interaction that produces high quality synthesized speech by using an advanced text-to-speech synthesis method. A prototype system is constructed on a work station with the Open Window system. Features of the prototype are that the operator can correct the faults of the text-to-speech synthesis method and produce high quality synthesized speech from input Japanese text. System operation has been optimized by adopting a real-time synthesizer and a GUI design based on mouse operations. A system evaluation confirms that character level correction is very effective for improving synthesized speech quality. The proposed system can be used to provide voice messages for a conventional digital audio response unit at low cost.
Bibliographic reference. Hakoda, Kazuo / Hirokawa, Tomohisa / Itoh, Kenzo (1994): "Speech editor based on enhanced user-system interaction for high quality text-to-speech synthesis", In ICSLP-1994, 1775-1778.