This paper describes a robust speech documents retrieval system that uses voice input keywords. To solve the in- evitable problems which arise when the input to the system is speech, i.e. misrecognition, a novel method was devel- oped, where, before the retrieval processing, unproductive keyword candidates are discarded by a grouping process- ing using the similarity between words and the recognition score of keywords. In retrieval experiments, we used the proposed method to retrieve Japanese broadcast news doc- uments through voice keywords input to the system and showed its eectiveness.
Cite as: Nishizaki, H., Nakagawa, S. (2000) A system for retrieving broadcast news speech documents using voice input keywords and similarity between words. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 1073-1076
@inproceedings{nishizaki00_icslp, author={Hiromitsu Nishizaki and Seiichi Nakagawa}, title={{A system for retrieving broadcast news speech documents using voice input keywords and similarity between words}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 1073-1076} }