ISCA Archive SpeechProsody 2008
ISCA Archive SpeechProsody 2008

Recognition of Japanese attitudes in audio-visual speech

Takaaki Shochi, Donna Erickson, Albert Rilliard, Véronique Aubergé, Jean-Claude Martin

The aim of the present work is to investigate how Japanese listeners recognize 12 audio-visual prosodic attitudes of Japanese. Significant influences of the two speakers and three modalities were observed. Generally the audio-visual condition showed the best recognition score, and interesting behavior for audio and visual modality was observed. Attitudes were regrouped into 3 higher-level perceptual categories for the first speaker: polite expressions, the attitudes of "query", and the expressions of imposition of one’s own opinion. The attitude of kyoshuku and surprise are particularly well recognized by visual information.


Cite as: Shochi, T., Erickson, D., Rilliard, A., Aubergé, V., Martin, J.-C. (2008) Recognition of Japanese attitudes in audio-visual speech. Proc. Speech Prosody 2008, 689-692

@inproceedings{shochi08_speechprosody,
  author={Takaaki Shochi and Donna Erickson and Albert Rilliard and Véronique Aubergé and Jean-Claude Martin},
  title={{Recognition of Japanese attitudes in audio-visual speech}},
  year=2008,
  booktitle={Proc. Speech Prosody 2008},
  pages={689--692}
}