ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Retrieval of mandarin broadcast news using spoken queries

Berlin Chen, Hsin-min Wang, Lin-shan Lee

Considering the monosyllabic structure of the Chinese language, a whole class of indexing features for retrieval of Mandarin broadcast news using syllable-level statistical characteristics has been previously investigated. This paper presents the improvements achieved over the previous results. The major differences are: (1) Multi-scale character- and word-level indexing terms have been integrated with the syllable-level information. (2) Information cues from the contemporary newswire text corpus have been used to create more accurate syllable indexing terms. (3) Automatic document expansion, blind relevance feedback, and query expansion via the term association matrix have been applied in retrieval. With all these schemes, the average precision can be improved from 55.46% to 71.29%.


Cite as: Chen, B., Wang, H.-m., Lee, L.-s. (2000) Retrieval of mandarin broadcast news using spoken queries. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 520-523

@inproceedings{chen00b_icslp,
  author={Berlin Chen and Hsin-min Wang and Lin-shan Lee},
  title={{Retrieval of mandarin broadcast news using spoken queries}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 520-523}
}