15th Annual Conference of the International Speech Communication Association

September 14-18, 2014

Semi-Supervised Training for Bottle-Neck Feature based DNN-HMM Hybrid Systems

Haihua Xu (1), Hang Su (2), Eng Siong Chng (1), Haizhou Li (3)

(1) Nanyang Technological University, Singapore
(3) A*STAR, Singapore

In this paper, we investigate semi-supervised training (SST) method in various state-of-the-art acoustic modeling techniques, using bottle-neck and corresponding tandem features. These techniques include subspace GMM, tanh-neuron deep neural network (DNN), and a generalized soft-maxout (p-norm) DNN. We demonstrate that SST may lead up to 2% Word Error Rate (WER) reduction using all these techniques in each case, and the best one comes from tandem feature based p-norm DNN system. In addition to recognition performance, effectiveness of the SST on keyword search performance is also investigated. Results on Actual Term Weighted Value (ATWV) are reported, with an analysis on lattice density. It is shown that SST may not necessarily increase ATWV due to the shrink of lattices size.

Full Paper

Bibliographic reference.  Xu, Haihua / Su, Hang / Chng, Eng Siong / Li, Haizhou (2014): "Semi-supervised training for bottle-neck feature based DNN-HMM hybrid systems", In INTERSPEECH-2014, 2078-2082.