15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Graph-Based Re-Ranking Using Acoustic Feature Similarity Between Search Results for Spoken Term Detection on Low-Resource Languages

Hung-yi Lee, Yu Zhang, Ekapol Chuangsuwanich, James R. Glass


Acoustic feature similarity between search results has been shown to be very helpful for the task of spoken term detection (STD). A graph-based re-ranking approach for STD has been proposed based on the concept that search results, which are acoustically similar to other results with higher confidence scores, should have higher scores themselves. In this approach, the similarity between all search results of a given term are considered as a graph, and the confidence scores of the search results propagate through this graph. Since this approach can improve STD results without any additional labelled data, it is especially suitable for STD on languages with limited amounts of annotated data. However, its performance has not been widely studied on benchmark corpora. In this paper, we investigate the effectiveness of the graph-based re-ranking approach on limited language data from the IARPA Babel program. Experiments on the low-resource languages, Assamese, Bengali and Lao, show that graph-based re-ranking improves STD systems using fuzzy matching, and lattices based on different kinds of units including words, subwords, and hybrids.

Full Paper

Bibliographic reference.  Lee, Hung-yi / Zhang, Yu / Chuangsuwanich, Ekapol / Glass, James R. (2014): "Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages", In INTERSPEECH-2014, 2479-2483.