8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Structural Bayesian Language Modeling and Adaptation

Sibel Yaman (1), Jen-Tzung Chien (2), Chin-Hui Lee (1)

(1) Georgia Institute of Technology, USA
(2) National Cheng Kung University, Taiwan

We propose a language modeling and adaptation framework using Bayesian structural maximum a posteriori (SMAP) principle, in which each n-gram event is embedded in a branch of a tree structure. The nodes in the first layer of this tree structure represent the unigrams, and those in the second layer represent the bigrams, and so on. Each node in the tree structure has an associated hyper-parameter representing the information about the prior distribution, and a count representing the number of times the word sequence occurs in the domain-specific data. In general, the hyper-parameters depend on the observation frequency of not only the node event but also its parent node of lower order n-gram event. Our automatic speech recognition experiments using the Wall Street Journal corpus verify that the proposed SMAP language model adaptation achieves a 5.6% relative improvement over maximum likelihood language models obtained with the same training and adaptation data sets.

Full Paper

Bibliographic reference.  Yaman, Sibel / Chien, Jen-Tzung / Lee, Chin-Hui (2007): "Structural Bayesian language modeling and adaptation", In INTERSPEECH-2007, 2365-2368.