Cultural Differences in Pattern Matching: Multisensory Recognition of Socio-affective Prosody

Takaaki Shochi, Jean-Luc Rouas, Marine Guerry, Donna Erickson


This study focuses on the cross-cultural differences in perception of audio visual prosodic recordings of Japanese social affects. The study compares cultural differences of perceptual patterns of 21 Japanese subjects with 20 French subjects who have no knowledge of Japanese language or Japanese social affects. The test material is a semantically affectively neutral utterance expressed in 9 various social affects by 2 Japanese speakers (one male, one female) who were chosen as best performers in our previous recognition experiment. The task was to create a specific audio-visual affect by choosing one video stimuli among 9 choices and one audio stimuli, again among 9 choices. The participants could preview each audio and video stimuli individually and also the combination of chosen stimuli. The results reveal that native subjects can correctly combine auditory and visually expressed social affects, showing some confusion inside semantic categories. Different matching patterns are observed for non-native subjects especially for a type of cultural-specific politeness.


 DOI: 10.21437/Interspeech.2018-1795

Cite as: Shochi, T., Rouas, J., Guerry, M., Erickson, D. (2018) Cultural Differences in Pattern Matching: Multisensory Recognition of Socio-affective Prosody. Proc. Interspeech 2018, 2201-2205, DOI: 10.21437/Interspeech.2018-1795.


@inproceedings{Shochi2018,
  author={Takaaki Shochi and Jean-Luc Rouas and Marine Guerry and Donna Erickson},
  title={Cultural Differences in Pattern Matching: Multisensory Recognition of Socio-affective Prosody},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2201--2205},
  doi={10.21437/Interspeech.2018-1795},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1795}
}