A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information

Lucas D. Terissi, Gonzalo Sad, Mauricio Cerda, Slim Ouni, Rodrigo Galvez, Juan C. Gómez, Bernard Girau, Nancy Hitschfeld-Kahler


A Bilingual Multimodal Speech Communication Corpus incorporating acoustic data as well as visual data related to face, hands and arms gestures during speech, is presented in this paper. This corpus comprises different speaking modalities, including scripted text speech, natural conversation and free speech. The corpus has been compiled in two different languages, viz., French and Spanish. The experimental setups for the recording of the corpus, the acquisition protocols and the employed equipment are described. Statistics regarding the number and gender of the speakers, number of words, number of sentences and duration of the recording sessions, are also provided. Preliminary results from the analysis of the correlation among speech, head and hand movements during spontaneous speech are also presented in this paper, showing that acoustic prosodic features are related with head and hand gestures.


 DOI: 10.21437/Interspeech.2018-2212

Cite as: Terissi, L.D., Sad, G., Cerda, M., Ouni, S., Galvez, R., Gómez, J.C., Girau, B., Hitschfeld-Kahler, N. (2018) A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information. Proc. Interspeech 2018, 2778-2782, DOI: 10.21437/Interspeech.2018-2212.


@inproceedings{Terissi2018,
  author={Lucas D. Terissi and Gonzalo Sad and Mauricio Cerda and Slim Ouni and Rodrigo Galvez and Juan C. Gómez and Bernard Girau and Nancy Hitschfeld-Kahler},
  title={A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2778--2782},
  doi={10.21437/Interspeech.2018-2212},
  url={http://dx.doi.org/10.21437/Interspeech.2018-2212}
}