8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003


Cross-Lingual Pronunciation Modelling for Indonesian Speech Recognition

Terrence Martin (1), Torbjorn Svendsen (2), Sridha Sridharan (1)

(1) Queensland University of Technology, Australia
(2) Norwegian University of Science and Technology, Norway

The resources necessary to produce Automatic Speech Recognition systems for a new language are considerable, and for many languages these resources are not available. This emphasizes the need for the development of generic techniques which overcome this data shortage. Indonesian is one language which suffers from this problem and whose population and importance suggest it could benefit from speech enabled technology. Accordingly, we investigate using English acoustic models to recognize Indonesian speech. The mapping process, where the symbolic representation of the Source language acoustic models is equated to the Target language phonetic units, has typically been achieved using one to one mapping techniques. This mapping method does not allow for the incorporation of predictable allophonic variation in the lexicon. Accordingly, in this paper we present the use of cross-lingual pronunciation modelling to extract context dependant mapping rules, which are subsequently used to produce a more accurate cross lingual lexicon.

Full Paper

Bibliographic reference.  Martin, Terrence / Svendsen, Torbjorn / Sridharan, Sridha (2003): "Cross-lingual pronunciation modelling for indonesian speech recognition", In EUROSPEECH-2003, 3125-3128.