12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

A Cross-Lingual Approach to the Development of an HMM-Based Speech Synthesis System for Malay

Mumtaz B. Mustafa (1), Raja N. Ainon (1), Roziati Zainuddin (1), Zuraidah M. Don (1), Gerry Knowles (2)

(1) Universiti Malaya, Malaysia
(2) Lingenium Sdn. Bhd., Malaysia

This research reports the development of an HMM-based speech synthesis system for Malay, which is an under-resourced language with few resources including recorded speech and segmental labels. We propose the cross-lingual use of resources for developing a Malay HMM-based speech synthesis system. We used the Festival English speech synthesis system to generate time-aligned phone transcriptions for Malay using specially constructed Malay grapheme-to-phoneme database and English CART. These transcriptions together with Malay recorded speech databases were used for training and synthesis of Malay speech. The effectiveness of the proposed approach is confirmed by intelligibility and naturalness tests on the synthetic speech.

Full Paper

Bibliographic reference.  Mustafa, Mumtaz B. / Ainon, Raja N. / Zainuddin, Roziati / Don, Zuraidah M. / Knowles, Gerry (2011): "A cross-lingual approach to the development of an HMM-based speech synthesis system for malay", In INTERSPEECH-2011, 3197-3200.