The requirements of the computer telephony (CT) industry place conflicting demands on text-to-speech (TTS) systems. Multilingual functionality and high quality output at telephone bandwidth requires detailed linguistic and acoustic analysis. At the same time, the need for robustness together with a high channel count and small memory footprint means that systems must be extremely efficient and databases must be kept small. We present a system which provides TTS for six languages, with 100 channels of highly natural output on a single DSP card.
Cite as: Monaghan, A., Kassaei, M., Luckin, M., Amador-Hernandez, M., Lowry, A., Faulkner, D., Sannier, F. (2001) Multilingual TTS for computer telephony: the aculab approach. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 513-516, doi: 10.21437/Eurospeech.2001-138
@inproceedings{monaghan01_eurospeech, author={Alex Monaghan and Mahmoud Kassaei and Mark Luckin and Mariscela Amador-Hernandez and Andrew Lowry and Dan Faulkner and Fred Sannier}, title={{Multilingual TTS for computer telephony: the aculab approach}}, year=2001, booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)}, pages={513--516}, doi={10.21437/Eurospeech.2001-138} }