5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Generating Segment Durations in a Text-zo-Speech System: A Hybrid Rule-Based/Neural Network Approach

G. Corrigan, N. Massey, O. Karaali

Speech Processing Laboratory Motorola Inc., Schaumburg, IL, USA

A combination of a neural network with rule firing information from a rule-based system is used to generate segment durations for a text-to-speech system. The system shows a slight improvement in performance over a neural network system without the rule firing information. Synthesized speech using segment durations was accepted by listeners as having about the same quality as speech generated using segment durations extracted from natural speech.

Bibliographic reference.  Corrigan, G. / Massey, N. / Karaali, O. (1997): "Generating segment durations in a text-zo-speech system: a hybrid rule-based/neural network approach", In EUROSPEECH-1997, 2675-2678.