In this paper we develop hidden Markov model (HMM) based approaches to identify Chinese dialects spoken in Taiwan. This task can be aided by exploiting various characteristic features of Chinese spoken languages. The baseline system performs phonotactic analysis after the speech utterance is tokenized into a sequence of five broad phonetic classes. The sequential statistics of the resulting symbols are then used to distinguish one dialect from another. The second approach we tested is to incor-porate dialect-dependent phonotactic constraints into the phonetic tokenization rather than applying these con-straints after the broad phonetic classification is complete. These algorithms were evaluated using a multi-speaker speech corpus of text-independent spontaneous speech data. Simulation results indicate that the acoustic-phonotactic approach to dialect identification yields better performance with an average identification rate of 89.6%, compared to 70% for the baseline system.
Cite as: Tsai, W.-H., Chang, W.-W. (1999) Chinese dialect identification using an acoustic-phonotactic model. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 367-370, doi: 10.21437/Eurospeech.1999-95
@inproceedings{tsai99_eurospeech, author={Wuei-He Tsai and Wen-Whei Chang}, title={{Chinese dialect identification using an acoustic-phonotactic model}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={367--370}, doi={10.21437/Eurospeech.1999-95} }