16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

Gaussian Free Cluster Tree Construction Using Deep Neural Network

Linchen Zhu, Kevin Kilgour, Sebastian Stüker, Alex Waibel

KIT, Germany

This paper presents a Gaussian free approach to constructing the cluster tree (CT) that context dependent acoustic models (CD-AM) depend on. Over the last few years deep neural networks (DNN) have supplanted Gaussian mixture models (GMM) as the default method for acoustic modeling (AM). DNN AMs have also been successfully used to flat start context independent (CI) AMs and generate alignments on which CTs can be trained. Those approaches however still required Gaussians to build their CTs. Our proposed Gaussian free CT algorithm eliminates this requirements and allows, for the first time, the flat start training of state of the art DNN AMs without the use of Gaussian. An evaluation on the IWSLT transcription task demonstrates the effectiveness of this approach.

Full Paper

Bibliographic reference.  Zhu, Linchen / Kilgour, Kevin / Stüker, Sebastian / Waibel, Alex (2015): "Gaussian free cluster tree construction using deep neural network", In INTERSPEECH-2015, 3254-3258.