Attention-Based Convolutional Neural Networks for Sentence Classification

Zhiwei Zhao, Youzheng Wu


Sentence classification is one of the foundational tasks in spoken language understanding (SLU) and natural language processing (NLP). In this paper we propose a novel convolutional neural network (CNN) with attention mechanism to improve the performance of sentence classification. In traditional CNN, it is not easy to encode long term contextual information and correlation between non-consecutive words effectively. In contrast, our attention-based CNN is able to capture these kinds of information for each word without any external features. We conducted experiments on various public and in-house datasets. The experimental results demonstrate that our proposed model significantly outperforms the traditional CNN model and achieves competitive performance with the ones that exploit rich syntactic features.


DOI: 10.21437/Interspeech.2016-354

Cite as

Zhao, Z., Wu, Y. (2016) Attention-Based Convolutional Neural Networks for Sentence Classification. Proc. Interspeech 2016, 705-709.

Bibtex
@inproceedings{Zhao+2016,
author={Zhiwei Zhao and Youzheng Wu},
title={Attention-Based Convolutional Neural Networks for Sentence Classification},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-354},
url={http://dx.doi.org/10.21437/Interspeech.2016-354},
pages={705--709}
}