In the last decades, encoder-decoders or autoencoders (AE) have received a great interest from researchers due to their capability to construct robust representations of documents in a low dimensional subspace. Nonetheless, autoencoders reveal little in way of spoken document internal structure by only considering words or topics contained in the document as an isolate basic element, and tend to overfit with small corpus of documents. Therefore, Quaternion Multi-layer Perceptrons (QMLP) have been introduced to capture such internal latent dependencies, whereas denoising autoencoders (DAE) are composed with different stochastic noises to better process small set of documents. This paper presents a novel autoencoder based on both hitherto-proposed DAE (to manage small corpus) and the QMLP (to consider internal latent structures) called “Quaternion denoising encoder-decoder” (QDAE). Moreover, the paper defines an original angular Gaussian noise adapted to the specificity of hyper-complex algebra. The experiments, conduced on a theme identification task of spoken dialogues from the DECODA framework, show that the QDAE obtains the promising gains of 3% and 1.5% compared to the standard real valued denoising autoencoder and the QMLP respectively.
Cite as: Parcollet, T., Morchid, M., Linarès, G. (2017) Quaternion Denoising Encoder-Decoder for Theme Identification of Telephone Conversations. Proc. Interspeech 2017, 3325-3328, doi: 10.21437/Interspeech.2017-1029
@inproceedings{parcollet17_interspeech, author={Titouan Parcollet and Mohamed Morchid and Georges Linarès}, title={{Quaternion Denoising Encoder-Decoder for Theme Identification of Telephone Conversations}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={3325--3328}, doi={10.21437/Interspeech.2017-1029} }