ISCA Archive Interspeech 2017
ISCA Archive Interspeech 2017

RNN-LDA Clustering for Feature Based DNN Adaptation

Xurong Xie, Xunying Liu, Tan Lee, Lan Wang

Model based deep neural network (DNN) adaptation approaches often require multi-pass decoding in test time. Input feature based DNN adaptation, for example, based on latent Dirichlet allocation (LDA) clustering, provide a more efficient alternative. In conventional LDA clustering, the transition and correlation between neighboring clusters is ignored. In order to address this issue, a recurrent neural network (RNN) based clustering scheme is proposed to learn both the standard LDA cluster labels and their natural correlation over time in this paper. In addition to directly using the resulting RNN-LDA as input features during DNN adaptation, a range of techniques were investigated to condition the DNN hidden layer parameters or activation outputs on the RNN-LDA features. On a DARPA Gale Mandarin Chinese broadcast speech transcription task, the proposed RNN-LDA cluster features adapted DNN system outperformed both the baseline un-adapted DNN system and conventional LDA features adapted DNN system by 8% relative on the most difficult Phoenix TV subset. Consistent improvements were also obtained after further combination with model based adaptation approaches.

doi: 10.21437/Interspeech.2017-368

Cite as: Xie, X., Liu, X., Lee, T., Wang, L. (2017) RNN-LDA Clustering for Feature Based DNN Adaptation. Proc. Interspeech 2017, 2396-2400, doi: 10.21437/Interspeech.2017-368

  author={Xurong Xie and Xunying Liu and Tan Lee and Lan Wang},
  title={{RNN-LDA Clustering for Feature Based DNN Adaptation}},
  booktitle={Proc. Interspeech 2017},