Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Investigations on Ensemble Based Semi-Supervised Acoustic Model Training

Rong Zhang, Ziad Al Bawab, Arthur Chan, Ananlada Chotimongkol, David Huggins-Daines, Alexander I. Rudnicky

Carnegie Mellon University, Pittsburgh, PA, USA

Semi-supervised learning has been recognized as an effective way to improve acoustic model training in cases where sufficient transcribed data are not available. Different from most of existing approaches only using single acoustic model and focusing on how to refine it, this paper investigates the feasibility of using ensemble methods for semi-supervised acoustic modeling training. Two methods are investigated here, one is a generalized Boosting algorithm, a second one is based on data partitions. Both methods demonstrate substantial improvement over baseline. More than 15% relative reduction of word error rate was observed in our experiments using a large real-world meeting recognition dataset.

Full Paper

Bibliographic reference.  Zhang, Rong / Bawab, Ziad Al / Chan, Arthur / Chotimongkol, Ananlada / Huggins-Daines, David / Rudnicky, Alexander I. (2005): "Investigations on ensemble based semi-supervised acoustic model training", In INTERSPEECH-2005, 1677-1680.