This paper presents a novel method for processing automatic speech recognition (ASR) lattices (and generally weighted finite state acceptors) in feed-forward artificial neural networks. It is based on the existing work focused on the text classification using convolutional neural networks (CNNs). The presented method generalizes the convolutional layer of the neural network so that it is able to process both the posterior probabilities and the lexical information contained in an ASR lattice. The convolutional layer was used in a CNN-based implementation of a hierarchical discriminative model (HDM). The method was evaluated using two semantically annotated corpora and the CNN-based HDM improves performance of a spoken language understanding module in comparison with an original HDM based on Support Vector Machines (SVM).
Bibliographic reference. Švec, Jan / Chýlek, Adam / Šmídl, Luboš (2015): "Hierarchical discriminative model for spoken language understanding based on convolutional neural network", In INTERSPEECH-2015, 1864-1868.