 |
2003 ISCA Workshop on
Multilingual Spoken Document Retrieval
(MSDR2003)
Hong Kong
April 4-5, 2003 |
 |
Issues in Speech-Based Retrieval of Video
H. J. Nock, G. Iyengar, C. Neti
IBM T. J. Watson Research Center,
Yorktown Heights, NY, USA
This paper discusses issues arising when applying the IBM
Audio-Indexing System to retrieval of video. Issues discussed
include the relationship between speech transcription
accuracy and retrieval performance, query processing
schemes and the critical problem of mapping between cues
in speech and the relevant video shots. The temporal relationship
between the occurrence of cues in speech transcripts
and relevant shots is quantified and then simple schemes
for performing this mapping are described and evaluated.
Experiments demonstrate the promise of more sophisticated
schemes involving up-front video ranking and one possible
implementation is discussed. Techniques are evaluated using
the TREC-2002 Video Track queries and corpus, comprising
a total of 68.45 hours of video.
Full Paper
Bibliographic reference.
Nock, H. J. / Iyengar, G. / Neti, C. (2003):
"Issues in speech-based retrieval of video",
In MSDR-2003, 67-72.