16th Annual Conference of the International Speech Communication Association

Dresden, Germany
September 6-10, 2015

Is it Time to Switch to Word Embedding and Recurrent Neural Networks for Spoken Language Understanding?

Vedran Vukotic (1), Christian Raymond (1), Guillaume Gravier (2)

(1) INSA de Rennes, France
(2) INRIA, France

Recently, word embedding representations have been investigated for slot filling in Spoken Language Understanding, along with the use of Neural Networks as classifiers. Neural Networks, especially Recurrent Neural Networks, that are specifically adapted to sequence labeling problems, have been applied successfully on the popular ATIS database. In this work, we make a comparison of this kind of models with the previously state-of-the-art Conditional Random Fields (CRF) classifier on a more challenging SLU database. We show that, despite efficient word representations used within these Neural Networks, their ability to process sequences is still significantly lower than for CRF, while also having a drawback of higher computational costs, and that the ability of CRF to model output label dependencies is crucial for SLU.

Full Paper

Bibliographic reference.  Vukotic, Vedran / Raymond, Christian / Gravier, Guillaume (2015): "Is it time to Switch to word embedding and recurrent neural networks for spoken language understanding?", In INTERSPEECH-2015, 130-134.