8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Recognition of Three Simultaneous Utterance of Speech by Four-Line Directivity Microphone Mounted on Head of Robot

Naoya Mochiki, Tetsunori Kobayashi, Toshiyuki Sekiya, Tetsuji Ogawa

Waseda University, Japan

A sound source separation method using four-line directivity microphones mounted on a head of a robot is proposed and applied to speech recognition under existence of two disturbances of speech. Sound source separation methods using microphones mounted on robot heads generally used strict head-related transfer functions (HRTF). We propose a robust sound source separation that does not require an estimate of a strict HRTF. Our method takes advantage of a sound pressure difference with the robot head acting as a sound barrier. The enhancement of the difference in the target speech is performed by signal processing of three layers:two-line SAFIA, two-line Spectral Subtraction and their integration. The experimental results of three simultaneous utterance recognition with vocabulary of 20K show that the proposed method is effective in achieving 71% error reduction.

Full Paper

Bibliographic reference.  Mochiki, Naoya / Kobayashi, Tetsunori / Sekiya, Toshiyuki / Ogawa, Tetsuji (2004): "Recognition of three simultaneous utterance of speech by four-line directivity microphone mounted on head of robot", In INTERSPEECH-2004, 821-824.