11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Brazilian Portuguese Acoustic Model Training Based on Data Borrowing from Other Language

Kazuhiko Abe, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura

National Institute of Information and Communications Technolog (NICT), Japan

This paper presents the acoustic modeling method for Portuguese speech recognizers. To improve the acoustic model, other language data are used to offset the lack of the model training data. In using this data-borrowing approach, we select training data with consideration given to the influence of the other language. A simple solution is to minimize the volume of data borrowed. We developed a data selection strategy based on two principles: the Phonetic Frequency Principle and Maximum Entropy Principle. Refining the acoustic model with this strategy, word accuracy is improved, especially words that contain a low-frequency phoneme.

Full Paper

Bibliographic reference.  Abe, Kazuhiko / Sakti, Sakriani / Isotani, Ryosuke / Kawai, Hisashi / Nakamura, Satoshi (2010): "Brazilian portuguese acoustic model training based on data borrowing from other language", In INTERSPEECH-2010, 861-864.