ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Fine keyword clustering using a thesaurus and example sentences for speech translation

Yumi Wakita, Kenji Matsui, Yoshinori Sagisaka

For robust speech translation, we propose a new language translation method in which speech recognition results are mapped to example sentences using keywords. In this method, the keyword clustering is used to cope with recognition errors and the wide variety of words that do not appear in the training corpus. Initial classes defined using only thesaurus are redefined by using the dependency between the keywords in limited number of example sentences. The effectiveness of our keyword clustering method is confirmed through example sentence search experiments. These experiments were done using keyword sets of (a) different sentences including keywords not in the example sentences and (b) recognition results those sentences in which recognition errors were obtained. Compared with the search method which uses keyword sets defined by using only a thesaurus, our proposed method offered improved search error rates.


Cite as: Wakita, Y., Matsui, K., Sagisaka, Y. (2000) Fine keyword clustering using a thesaurus and example sentences for speech translation. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 390-393

@inproceedings{wakita00_icslp,
  author={Yumi Wakita and Kenji Matsui and Yoshinori Sagisaka},
  title={{Fine keyword clustering using a thesaurus and example sentences for speech translation}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 390-393}
}