10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Nonstationary Latent Dirichlet Allocation for Speech Recognition

Chuang-Hua Chueh, Jen-Tzung Chien

National Cheng Kung University, Taiwan

Latent Dirichlet allocation (LDA) has been successful for document modeling. LDA extracts the latent topics across documents. Words in a document are generated by the same topic distribution. However, in real-world documents, the usage of words in different paragraphs is varied and accompanied with different writing styles. This study extends the LDA and copes with the variations of topic information within a document. We build the nonstationary LDA (NLDA) by incorporating a Markov chain which is used to detect the stylistic segments in a document. Each segment corresponds to a particular style in composition of a document. This NLDA can exploit the topic information between documents as well as the word variations within a document. We accordingly establish a Viterbi-based variational Bayesian procedure. A language model adaptation scheme using NLDA is developed for speech recognition. Experimental results show improvement of NLDA over LDA in terms of perplexity and word error rate.

Full Paper

Bibliographic reference.  Chueh, Chuang-Hua / Chien, Jen-Tzung (2009): "Nonstationary latent Dirichlet allocation for speech recognition", In INTERSPEECH-2009, 372-375.