9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Applications of Virtual-Evidence Based Speech Recognizer Training

Amarnag Subramanya, Jeff A. Bilmes

University of Washington, USA

We present two applications of our previously proposed virtualevidence (VE) based speech recognizer training algorithm [1, 2]. The first relates to two-pass training where segmentations obtained during the first pass are used as VE to train the subsequent pass. We use the TIMIT phone and SVitchboard continuous speech recognition tasks to demonstrate the benefits of using VE based training in two-pass systems. The second application involves making use of functions that can incorporate prior domain knowledge to generate VE-scores. Here, in the case of TIMIT phone recognition, we show that using the proposed function to generate VE-scores results in about 6% relative error rate reduction over the baseline.

Full Paper

Bibliographic reference.  Subramanya, Amarnag / Bilmes, Jeff A. (2008): "Applications of virtual-evidence based speech recognizer training", In INTERSPEECH-2008, 2562-2565.