ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Discrimination of task-related words for vocabulary design of spoken dialog systems

Akinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki

This paper describes a method used to determine if a specific word is related to a certain spoken dialog task. In most ordinary spoken dialog systems, only the words that are actually used to achieve the task are included in the vocabulary. Therefore, the system cannot recognize utterances that contain OOV words that are related to the task. Therefore, we developed a method for determining the words that are related to a specified task in order to augment the system's vocabulary. Our method is based on word similarity. We examined three similarities: word occurrence frequency on the Web, distance in a thesaurus and word similarity using LSA. The experiment revealed that the thesaurus-based and LSA-based methods have an OOV problem. To solve the problem, we developed a way to combine these two methods with theWeb-based method. In addition, we tried combining the methods using the AdaBoost algorithm.


doi: 10.21437/Interspeech.2008-65

Cite as: Ito, A., Meguro, T., Makino, S., Suzuki, M. (2008) Discrimination of task-related words for vocabulary design of spoken dialog systems. Proc. Interspeech 2008, 207-210, doi: 10.21437/Interspeech.2008-65

@inproceedings{ito08_interspeech,
  author={Akinori Ito and Toyomi Meguro and Shozo Makino and Motoyuki Suzuki},
  title={{Discrimination of task-related words for vocabulary design of spoken dialog systems}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={207--210},
  doi={10.21437/Interspeech.2008-65}
}