EUROSPEECH 2003 - INTERSPEECH 2003
We propose a new style of practical input interface for the recognition of non-audible murmur (NAM), i.e., for the recognition of inaudible speech produced without vibration of the vocal folds. We developed a microphone attachment, which adheres to the skin, applying the principle of a medical stethoscope, found the ideal position for sampling flesh-conducted NAM sound vibration and retrained an acoustic model with NAM samples. Then using the Julius Japanese Dictation Toolkit, we tested the possibilities for practical use of this method in place of an external microphone for analyzing air-conducted voice sound. Additionally we propose laryngeal elevation index (LEI), a new index of prosody, which can show the prosody of NAM without F0, using simple processing of images from medical ultrasonography. We realized and defined NAM never used for input or communication and propose that we should make use of it for the interface of human-human and human-cybernetic machines.
Bibliographic reference. Nakajima, Yoshitaka / Kashioka, Hideki / Shikano, Kiyohiro / Campbell, Nick (2003): "Non-audible murmur recognition", In EUROSPEECH-2003, 2601-2604.