11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

VAD-Measure-Embedded Decoder with Online Model Adaptation

Tasuku Oonishi (1), Koji Iwano (2), Sadaoki Furui (1)

(1) Tokyo Institute of Technology, Japan
(2) Tokyo City University, Japan

We previously proposed a decoding method utilizing hypothesis scores weighted by VAD-measures. This method uses two GMMs to obtain confidence measures. To achieve good search performance, we need to adapt the GMMs for environmental noise. We describe a new unsupervised on-line GMM adaptation method based on MAP estimation. The robustness of our method is further improved by weighting updating parameters of GMMs according to the confidence measure for the adaptation data. We also describe an approach to accelerate the adaptation by caching statistical values to adapt GMMs. Experimental results show that our adaptive decoding method significantly improves the word accuracy in a noisy environment with only a minor increase in the computational cost.

Full Paper

Bibliographic reference.  Oonishi, Tasuku / Iwano, Koji / Furui, Sadaoki (2010): "VAD-measure-embedded decoder with online model adaptation", In INTERSPEECH-2010, 3122-3125.