This paper outlines ProSynth, an approach to speech synthesis which takes a rich linguistic structure as central to the generation of natural-sounding speech. We start from the assumption that the speech signal is informationally rich, and that this acoustic richness reflects linguistic structural richness and underlies the percept of naturalness. Naturalness achieved by structural richness produces a perceptually robust signal intelligible in adverse listening conditions. ProSynth uses syntactic and phonological parses to model the fine acoustic-phonetic detail of real speech, segmentally, temporally and intonationally.
Cite as: Hawkins, S., House, J., Huckvale, M., Local, J., Ogden, R. (1998) Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesis. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0538, doi: 10.21437/ICSLP.1998-11
@inproceedings{hawkins98_icslp, author={Sarah Hawkins and Jill House and Mark Huckvale and John Local and Richard Ogden}, title={{Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesis}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0538}, doi={10.21437/ICSLP.1998-11} }