14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Recurrent Neural Networks for Language Understanding

Kaisheng Yao (1), Geoffrey Zweig (2), Mei-Yuh Hwang (1), Yangyang Shi (3), Dong Yu (2)

(1) Microsoft, China
(2) Microsoft Research, USA
(3) Technische Universiteit Delft, The Netherlands

Recurrent Neural Network Language Models (RNN-LMs) have recently shown exceptional performance across a variety of applications. In this paper, we modify the architecture to perform Language Understanding, and advance the state-of-the-art for the widely used ATIS dataset. The core of our approach is to take words as input as in a standard RNN-LM, and then to predict slot labels rather than words on the output side. We present several variations that differ in the amount of word context that is used on the input side, and in the use of non-lexical features. Remarkably, our simplest model produces state-of-the-art results, and we advance state-of-the-art through the use of bag-of-words, word embedding, named-entity, syntactic, and word-class features. Analysis indicates that the superior performance is attributable to the task-specific word representations learned by the RNN.

Full Paper

Bibliographic reference.  Yao, Kaisheng / Zweig, Geoffrey / Hwang, Mei-Yuh / Shi, Yangyang / Yu, Dong (2013): "Recurrent neural networks for language understanding", In INTERSPEECH-2013, 2524-2528.