15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

A Comparison of Multiple Methods for Rescoring Keyword Search Lists for Low Resource Languages

Victor Soto (1), Lidia Mangu (2), Andrew Rosenberg (3), Julia Hirschberg (1)

(1) Columbia University, USA
(2) IBM T.J. Watson Research Center, USA
(3) CUNY Graduate Center, USA

We review the performance of a new two-stage cascaded machine learning approach for rescoring keyword search output for low resource languages. In the first stage Confusion Networks (CNs) are rescored for improved Automatic Speech Recognition (ASR) by reranking the arcs of each confusion bin. In the second stage we generate keyword search hypotheses from the rescored ASR output and rescore them using logistic regression classifiers to detect true hits and false alarms. We compare the performance of our system with state of the art rescoring techniques, including probability of false alarm normalization, exponential normalization, rank-normalized posterior scores and sum-to-one normalization and show promising results. Experimental validation is performed using the Term Weighted Value (TWV) metric on four corpora from the IARPA-Babel program for keyword search on low resource languages, including Assamese, Bengali, Lao and Zulu.

Full Paper

Bibliographic reference.  Soto, Victor / Mangu, Lidia / Rosenberg, Andrew / Hirschberg, Julia (2014): "A comparison of multiple methods for rescoring keyword search lists for low resource languages", In INTERSPEECH-2014, 2464-2468.