ISCA Archive ICSLP 2000
Limitations to concatenative speech synthesis

Nick Campbell

This paper discusses techniques for determining the linguistic needs for open-domain synthesis by concatenative methods, and reports on the design and evaluation of a tool for collecting and balancing a speech corpus automatically, in order to ensure optimal coverage of the sounds required for synthesis within a given task-domain. Synthetically-generated utterances are used to prompt speakers, and in-line acoustic analysis determines the prosodic as well as phonemic balance of the resulting speech during recording, re-prompting the speaker with textually modified versions if necessary, to elicit the desired articulation sequences. The closed-loop process, which incorporates human self-correction and evaluation, allows for more ecient collection of a balanced corpus for concatenative speech synthesis.

Cite as: Campbell, N. (2000) Limitations to concatenative speech synthesis. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 416-419

  author={Nick Campbell},
  title={{Limitations to concatenative speech synthesis}},
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 416-419}