A Saliency-Based Attention LSTM Model for Cognitive Load Classification from Speech

Ascensión Gallardo-Antolín, Juan Manuel Montero


Cognitive Load (CL) refers to the amount of mental demand that a given task imposes on an individual’s cognitive system and it can affect his/her productivity in very high load situations. In this paper, we propose an automatic system capable of classifying the CL level of a speaker by analyzing his/her voice. Our research on this topic goes into two main directions. In the first one, we focus on the use of Long Short-Term Memory (LSTM) networks with different weighted pooling strategies for CL level classification. In the second contribution, for overcoming the need of a large amount of training data, we propose a novel attention mechanism that uses the Kalinli’s auditory saliency model. Experiments show that our proposal outperforms significantly both, a baseline system based on Support Vector Machines (SVM) and a LSTM-based system with logistic regression attention model.


 DOI: 10.21437/Interspeech.2019-1603

Cite as: Gallardo-Antolín, A., Montero, J.M. (2019) A Saliency-Based Attention LSTM Model for Cognitive Load Classification from Speech. Proc. Interspeech 2019, 216-220, DOI: 10.21437/Interspeech.2019-1603.


@inproceedings{Gallardo-Antolín2019,
  author={Ascensión Gallardo-Antolín and Juan Manuel Montero},
  title={{A Saliency-Based Attention LSTM Model for Cognitive Load Classification from Speech}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={216--220},
  doi={10.21437/Interspeech.2019-1603},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1603}
}