ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Deep belief network based semantic taggers for spoken language understanding

Anoop Deoras, Ruhi Sarikaya

This paper investigates the use of deep belief networks (DBN) for semantic tagging, a sequence classification task, in spoken language understanding (SLU).We evaluate the performance of the DBN based sequence tagger on the well-studied ATIS task and compare our technique to conditional random fields (CRF), a stateof- the-art classifier for sequence classification. In conjunction with lexical and named entity features, we also use dependency parser based syntactic features and part of speech (POS) tags. Under both noisy conditions (output of automatic speech recognition system) and clean conditions (manual transcriptions), our deep belief network based sequence tagger outperforms the best CRF based system described in [1] by an absolute 2% and 1% F-measure, respectively. Upon carrying out an analysis of cases where CRF and DBN models made different predictions, we observed that when discrete features are projected onto a continuous space during neural network training, the model learns to cluster these features leading to its improved generalization capability, relative to a CRF model, especially in cases where some features are either missing or noisy.

G. Tur, D. Hakkani-Tur, L. Heck, and S. Parthasarathy, "Sentence Simplification for Spoken Language Understanding," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, 5628–5631.


doi: 10.21437/Interspeech.2013-623

Cite as: Deoras, A., Sarikaya, R. (2013) Deep belief network based semantic taggers for spoken language understanding. Proc. Interspeech 2013, 2713-2717, doi: 10.21437/Interspeech.2013-623

@inproceedings{deoras13_interspeech,
  author={Anoop Deoras and Ruhi Sarikaya},
  title={{Deep belief network based semantic taggers for spoken language understanding}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2713--2717},
  doi={10.21437/Interspeech.2013-623}
}