EUROSPEECH 2003 - INTERSPEECH 2003
A large vocabulary Taiwanese (Min-nan) speech recognition system is described in this paper. Due to the severe multiple pronunciation phenomenon in Taiwanese partly caused by tone sandhi, a statistical pronunciation modeling technique based on tonal features is used. This system is speaker independent. It was trained by a bi-lingual Mandarin/Taiwanese speech corpus to alleviate the lack of pure Taiwanese speech corpus. The searching network is constructed based on nodes of Chinese characters and results in the direct output Chinese character string. Experiments show that by using the approaches proposed in this paper, the character error rate can decrease significantly from 21.50% to 11.97%.
Bibliographic reference. Lyu, Dau-Cheng / Liang, Min-Siong / Chiang, Yuang-Chin / Hsu, Chun-Nan / Lyu, Ren-Yuan (2003): "Large vocabulary taiwanese (min-nan) speech recognition using tone features and statistical pronunciation modeling", In EUROSPEECH-2003, 1861-1864.