15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Word-Based Probabilistic Phonetic Retrieval for Low-Resource Spoken Term Detection

Di Xu, Florian Metze

Carnegie Mellon University, USA

Two problems make Spoken Term Detection (STD) particularly challenging under low-resource conditions: the low quality of speech recognition hypotheses, and a high number of out-of-vocabulary (OOV) words. In this paper, we propose an intuitive way to handle OOV terms for STD on word-based Confusion Networks using phonetic similarities, and generalize it into a probabilistic and vocabulary-independent retrieval framework. We then reflect on how several heuristics and Machine Learning based methods can be incorporated into this framework to improve retrieval performance. We present experimental results on several low-resource languages from IARPA's Babel program, such as Assamese, Bengali, Haitian, and Lao.

Full Paper

Bibliographic reference.  Xu, Di / Metze, Florian (2014): "Word-based probabilistic phonetic retrieval for low-resource spoken term detection", In INTERSPEECH-2014, 2774-2778.