ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Nonstationary latent Dirichlet allocation for speech recognition

Chuang-Hua Chueh, Jen-Tzung Chien

Latent Dirichlet allocation (LDA) has been successful for document modeling. LDA extracts the latent topics across documents. Words in a document are generated by the same topic distribution. However, in real-world documents, the usage of words in different paragraphs is varied and accompanied with different writing styles. This study extends the LDA and copes with the variations of topic information within a document. We build the nonstationary LDA (NLDA) by incorporating a Markov chain which is used to detect the stylistic segments in a document. Each segment corresponds to a particular style in composition of a document. This NLDA can exploit the topic information between documents as well as the word variations within a document. We accordingly establish a Viterbi-based variational Bayesian procedure. A language model adaptation scheme using NLDA is developed for speech recognition. Experimental results show improvement of NLDA over LDA in terms of perplexity and word error rate.


doi: 10.21437/Interspeech.2009-118

Cite as: Chueh, C.-H., Chien, J.-T. (2009) Nonstationary latent Dirichlet allocation for speech recognition. Proc. Interspeech 2009, 372-375, doi: 10.21437/Interspeech.2009-118

@inproceedings{chueh09_interspeech,
  author={Chuang-Hua Chueh and Jen-Tzung Chien},
  title={{Nonstationary latent Dirichlet allocation for speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={372--375},
  doi={10.21437/Interspeech.2009-118}
}