This paper presents work in progress towards building a Xhosa speech synthesizer. HTS is being used for this purpose due to certain desirable properties. As a minority language, linguistic resources for Xhosa are limited despite a variety of impressionistic phonetic studies, prompting a minimalist approach and a preference for data-driven methods. Xhosa is an agglutinative language, and is also held to be a tonal language, which therefore requires morphological analysis and tonal information in order to generate intelligible speech. By taking into account more recent findings on the nature of Xhosa prosody, it appears that a minimalist approach that excludes tone information is possible. We implement the system using HTS. Such a data-driven TTS system is a useful tool to test various syntactic and other features in text that influence Xhosa prosody.
Cite as: Roux, J.C., Visagie, A.S. (2007) Data-driven approach to rapid prototyping Xhosa speech synthesis. Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6), 143-147
@inproceedings{roux07_ssw, author={Justus C. Roux and Albert S. Visagie}, title={{Data-driven approach to rapid prototyping Xhosa speech synthesis}}, year=2007, booktitle={Proc. 6th ISCA Workshop on Speech Synthesis (SSW 6)}, pages={143--147} }