ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Using same-language machine translation to create alternative target sequences for text-to-speech synthesis

Peter Cahill, Jinhua Du, Andy Way, Julie Carson-Berndsen

Modern speech synthesis systems attempt to produce speech utterances from an open domain of words. In some situations, the synthesiser will not have the appropriate units to pronounce some words or phrases accurately but it still must attempt to pronounce them. This paper presents a hybrid machine translation and unit selection speech synthesis system. The machine translation system was trained with English as the source and target language. Rather than the synthesiser only saying the input text as would happen in conventional synthesis systems, the synthesiser may say an alternative utterance with the same meaning. This method allows the synthesiser to overcome the problem of insufficient units in runtime.


doi: 10.21437/Interspeech.2009-408

Cite as: Cahill, P., Du, J., Way, A., Carson-Berndsen, J. (2009) Using same-language machine translation to create alternative target sequences for text-to-speech synthesis. Proc. Interspeech 2009, 1307-1310, doi: 10.21437/Interspeech.2009-408

@inproceedings{cahill09_interspeech,
  author={Peter Cahill and Jinhua Du and Andy Way and Julie Carson-Berndsen},
  title={{Using same-language machine translation to create alternative target sequences for text-to-speech synthesis}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1307--1310},
  doi={10.21437/Interspeech.2009-408}
}