ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Expanded vector space model based on word space in cross media retrieval of news speech data

Seiichi Takao, Jun Ogata, Yasuo Ariki

News On Demand System using speech technology usually employs automatic speech transcriptions to retrieve the news data. In the retrieval, users specify a few keywords or sentences as a query and the related news data can be retrieved using the speech transcription. However when users can’t give a query clearly, a video shot of news program which users are watching will become a good query to retrieve the related news data. As one of such kinds of news data retrieval, we propose here to employ video captions as a query and to retrieve the related news data using speech transcription. We call this kind of retrieval as cross media retrieval due to its media cross over. Conventionally available method in cross media retrieval is standard cosine measure in vector space model. In this conventional method, there is a problem of impossibility of semantic level retrieval. To solve this problem, we propose here an expanded vector space model based on a word space. Experimental results found that the expanded vector space model based on the word space has superiority to the conventional vector space model.


Cite as: Takao, S., Ogata, J., Ariki, Y. (2000) Expanded vector space model based on word space in cross media retrieval of news speech data. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 1085-1088

@inproceedings{takao00_icslp,
  author={Seiichi Takao and Jun Ogata and Yasuo Ariki},
  title={{Expanded vector space model based on word space in cross media retrieval of news speech data}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 1085-1088}
}