Spoken Language Understanding in a Latent Topic-Based Subspace

Mohamed Morchid, Mohamed Bouaziz, Waad Ben Kheder, Killian Janod, Pierre-Michel Bousquet, Richard Dufour, Georges Linarès

Performance of spoken language understanding applications declines when spoken documents are automatically transcribed in noisy conditions due to high Word Error Rates (WER). To improve the robustness to transcription errors, recent solutions propose to map these automatic transcriptions in a latent space. These studies have proposed to compare classical topic-based representations such as Latent Dirichlet Allocation (LDA), supervised LDA and author-topic (AT) models. An original compact representation, called c-vector, has recently been introduced to walk around the tricky choice of the number of latent topics in these topic-based representations. Moreover, c-vectors allow to increase the robustness of document classification with respect to transcription errors by compacting different LDA representations of a same speech document in a reduced space and then compensate most of the noise of the document representation. The main drawback of this method is the number of sub-tasks needed to build the c-vector space. This paper proposes to both improve this compact representation (c-vector) of spoken documents and to reduce the number of needed sub-tasks, using an original framework in a robust low dimensional space of features from a set of AT models called “Latent Topic-based Subspace” (LTS). In comparison to LDA, the AT model considers not only the dialogue content (words), but also the class related to the document. Experiments are conducted on the DECODA corpus containing speech conversations from the call-center of the RATP Paris transportation company. Results show that the original LTS representation outperforms the best previous compact representation (c-vector), with a substantial gain of more than 2.5% in terms of correctly labeled conversations.

DOI: 10.21437/Interspeech.2016-50

Cite as

Morchid, M., Bouaziz, M., Kheder, W.B., Janod, K., Bousquet, P., Dufour, R., Linarès, G. (2016) Spoken Language Understanding in a Latent Topic-Based Subspace. Proc. Interspeech 2016, 710-714.

author={Mohamed Morchid and Mohamed Bouaziz and Waad Ben Kheder and Killian Janod and Pierre-Michel Bousquet and Richard Dufour and Georges Linarès},
title={Spoken Language Understanding in a Latent Topic-Based Subspace},
booktitle={Interspeech 2016},