Speech Recognition and Intrinsic Variation (SRIV2006)

Toulouse, France
May 20, 2006

The Influence of Word Detection Variability on IR Performance in Automatic Audio Indexing of Course Lectures

Renato Rispoli (1), Richard C. Rose (1), Jon Arrowood (2)

(1) Dept. of Electrical and Computer Engineering, McGill University, Montreal, QC, Canada
(2) Nexidia Inc., Atlanta, GA, USA

This paper presents a study of the influence of acoustic variability on topic spotting performance in an application involving automatic indexing of course lectures. The application involves users formulating keyword queries to an indexing system which includes phone lattice based acoustic representations of audio material, a mechanism for keyword searching of a phone lattice, and a measure for evaluating the relevance of short segments of the audio files with respect to the query. The paper describes the results of a study where keyword detection performance for query terms provided by domain experts is related to the information retrieval (IR) performance obtained using a novel information retrieval measure. IR performance obtained from text transcriptions of the audio recordings and from the audio recordings themselves are presented.

Full Paper
Presentation (.ppt)

Bibliographic reference.  Rispoli, Renato / Rose, Richard C. / Arrowood, Jon (2006): "The influence of word detection variability on IR performance in automatic audio indexing of course lectures", In SRIV-2006, 129-134.