INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Multimodal Expression for Humanoid Robots by Integration of Human Speech Mimicking and Facial Color

Tokitomo Ariyoshi, Kazuhiro Nakadai, Hiroshi Tsujino

Honda Research Institute Japan, Co., Ltd., Japan

Multimodal expression is essential for humanoid robots to communicate with people naturally and intelligibly. This paper describes multimodal expression for humanoid robots by mimicking human speech with the ability of expression through "facial colors". Currently the robot is able to express joy (by turning yellow in the face), anger (red), sadness (blue), and relaxation (green). These colors have been selected according to color psychology. The human speech mimicking is based on prosody extraction of pitch, loudness and temporal information with speech synthesis based on the extracted prosody. The multimodal expression system implemented on Honda ASIMO shows that facial colors improve affective speech recognition by over 15%. In addition, qualitative observations that use speech and facial color with conflicting affective meanings producing complex affection have been reported.

Full Paper

Bibliographic reference.  Ariyoshi, Tokitomo / Nakadai, Kazuhiro / Tsujino, Hiroshi (2004): "Multimodal expression for humanoid robots by integration of human speech mimicking and facial color", In INTERSPEECH-2004, 2305-2308.