ISCA Archive SpeechProsody 2010
ISCA Archive SpeechProsody 2010

Cross-cultural perception of Vietnamese audio-visual prosodic attitudes

Dang-Khoa Mac, Véronique Aubergé, Albert Rilliard, Eric Castelli

Prosodic attitudes (social affects) are highly linked to the language through the culture, and are a main part of face to face interaction. Therefore, for description and modeling, as well as for applications like translation, language learning or synthesis, a cross-cultural approach is relevant. This paper presents a cross-perception of Audio-Visual prosodic attitudes in Vietnamese, an under-resourced tonal language. Based on an audio-visual corpus of 16 attitudes, perception experiments were carried out with Vietnamese and French participants: firstly, to understand the contribution of audio and visual modalities to affective communication; secondly, to perceptually measure how the native and non-native listeners recognize and confuse the Vietnamese attitudes. The results reveal cultural specificities and cross-cultural common attitudes in Vietnamese.

Index Terms: Audio-visual corpus, Prosodic social affects, Cross-cultural perception, Vietnamese

Cite as: Mac, D.-K., Aubergé, V., Rilliard, A., Castelli, E. (2010) Cross-cultural perception of Vietnamese audio-visual prosodic attitudes. Proc. Speech Prosody 2010, paper 356

  author={Dang-Khoa Mac and Véronique Aubergé and Albert Rilliard and Eric Castelli},
  title={{Cross-cultural perception of Vietnamese audio-visual prosodic attitudes}},
  booktitle={Proc. Speech Prosody 2010},
  pages={paper 356}