12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Approximate Inference for Domain Detection in Spoken Language Understanding

Asli Celikyilmaz, Dilek Hakkani-Tür, Gokhan Tur

Microsoft Speech Labs, USA

This paper presents a semi-latent topic model for semantic domain detection in spoken language understanding systems. We use labeled utterance information to capture latent topics, which directly correspond to semantic domains. Additionally, we introduce an 'informative prior' for Bayesian inference that can simultaneously segment utterances of known domains into classes and divide them from out-of-domain utterances. We show that our model generalizes well on the task of classifying spoken language utterances and compare its results to those of an unsupervised topic model, which does not use labeled information.

Full Paper

Bibliographic reference.  Celikyilmaz, Asli / Hakkani-Tür, Dilek / Tur, Gokhan (2011): "Approximate inference for domain detection in spoken language understanding", In INTERSPEECH-2011, 713-716.