ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

A monolingual semantic decoder based on word sense disambiguation for mixed language understanding

Xiaohu Liu, Pascale Fung, Chi Shun Cheung

In this paper, a new method for spoken mixed language understanding is presented. By mixed language, we mean that the words included in one sentence may come from different languages, a primary language and a secondary language. In conventional statistical semantic decoders, the conceptual structure is represented as a hidden Markov model, the decoding of the conceptual content of a sentence is carried out with the Viterbi algorithm. To handle mixed language, an unsupervised word sense disambiguation module is proposed to convert the secondary language words into the primary language. The approach is evaluated in the ATIS domain, where the primary language is English and we assume the secondary language is Chinese. The average accuracy of our extended semantic decoder is 26% higher than the accuracy of the baseline semantic decoder. The advantages of the extended semantic decoder are (1) it can handle mixed language input, and (2) it needs neither secondary language training data nor mixed language training data. The approach can be used for any main-secondary language pairs.


doi: 10.21437/Eurospeech.1999-444

Cite as: Liu, X., Fung, P., Cheung, C.S. (1999) A monolingual semantic decoder based on word sense disambiguation for mixed language understanding. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2011-2014, doi: 10.21437/Eurospeech.1999-444

@inproceedings{liu99e_eurospeech,
  author={Xiaohu Liu and Pascale Fung and Chi Shun Cheung},
  title={{A monolingual semantic decoder based on word sense disambiguation for mixed language understanding}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2011--2014},
  doi={10.21437/Eurospeech.1999-444}
}