ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

When will synthetic speech sound human: role of rules and data

Jan van Santen, Michael Macon, Andrew Cronk, John-Paul Hosom, Alexander Kain, Vincent Pagel, Johan Wouters

Text-to-speech synthesis research has moved away from building general purpose systems based on an understanding of human language and speech production towards building systems based on statistical algorithms applied to large text and speech corpora, and, recently, towards building such systems for specific domains. Despite substantial progress, the overall quality of even the best systems is often still inadequate for broad user acceptance in applications that cannot also be handled with simple phrase splicing. This tutorial paper analyzes which problems must be addressed to achieve the goal of generating naturalsounding speech in limited domains in a cost-effective way, and the roles of data and rules as we work towards solutions.


Cite as: Santen, J.v., Macon, M., Cronk, A., Hosom, J.-P., Kain, A., Pagel, V., Wouters, J. (2000) When will synthetic speech sound human: role of rules and data. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 402-409

@inproceedings{santen00_icslp,
  author={Jan van Santen and Michael Macon and Andrew Cronk and John-Paul Hosom and Alexander Kain and Vincent Pagel and Johan Wouters},
  title={{When will synthetic speech sound human: role of rules and data}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 402-409}
}