This paper discusses how a concatenative Kiswahili Text to Speech System (TTS) was developed based on the Festival Unit Selection Speech Synthesiser. It explains how important Kiswahili linguistic features such as phones, stress and intonation were modelled as inputs to the Festival engine. It also discusses the design, recording and segmentation of the speech database, beginning with text corpus collection and transcription. The choice of the speaker, which is crucial to realising a good TTS is discussed and also how the system was tested.
Cite as: Gakuru, M., Iraki, F.K., Tucker, R., Shalonova, K., Ngugi, K. (2005) Development of a Kiswahili text to speech system. Proc. Interspeech 2005, 1481-1484, doi: 10.21437/Interspeech.2005-522
@inproceedings{gakuru05_interspeech, author={Mucemi Gakuru and Frederick K. Iraki and Roger Tucker and Ksenia Shalonova and Kamanda Ngugi}, title={{Development of a Kiswahili text to speech system}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={1481--1484}, doi={10.21437/Interspeech.2005-522} }