11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Improved Spoken Term Detection by Feature Space Pseudo-Relevance Feedback

Chia-ping Chen, Hung-yi Lee, Ching-feng Yeh, Lin-shan Lee

National Taiwan University, Taiwan

In this paper, we propose an improved approach for spoken term detection using pseudo-relevance feedback. To remedy the problem of unmatched acoustic models with respect to spoken utterances produced under different acoustic conditions, which may give relatively poor recognition output, we integrate the relevance scores derived from the lattices with the DTW distances derived from the feature space of MFCC parameters or phonetic posteriorgrams. These DTW distances are evaluated for a carefully selected set of pseudo-relevant utterances, which obtained from the first-pass returned list given by the search engine. The utterances on the first-pass returned list are then reranked accordingly and finally shown to the user. Very encouraging, performance improvements were obtained in the preliminary experiments, especially when the acoustic models are poorly matched to the spoken utterances.

Full Paper

Bibliographic reference.  Chen, Chia-ping / Lee, Hung-yi / Yeh, Ching-feng / Lee, Lin-shan (2010): "Improved spoken term detection by feature space pseudo-relevance feedback", In INTERSPEECH-2010, 1672-1675.