We describe a method of synthesising contextually appropriate intonation with limited domain unit selection voices. The method enables the natural language generation component of a dialogue system to specify its intonation choices via APML, an XML-based markup language. In a pilot study, we built an APML-aware limited domain voice for use in flight information dialogues, and carried out a perception experiment comparing the APML voice to a default version built using the same recordings without the additional structure. The intonation produced by the APML voice was judged significantly more contextually appropriate than that of the default voice. These results justified building a second voice with a much larger vocabulary, using an automated script generation algorithm.
Cite as: Baker, R., Clark, R.A.J., White, M. (2004) Synthesising contextually appropriate intonation in limited domains. Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5), 91-96
@inproceedings{baker04_ssw, author={Rachel Baker and Robert A. J. Clark and Michael White}, title={{Synthesising contextually appropriate intonation in limited domains}}, year=2004, booktitle={Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5)}, pages={91--96} }