9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Preliminary Evaluation of Speech/Sound Recognition for Telemedicine Application in a Real Environment

Michel Vacher (1), Anthony Fleury (2), Jean-François Serignat (1), Norbert Noury (2), Hubert Glasson (1)

(1) LIG, France; (2) TIMC-IMAG, France

Improvements in medicine increase life expectancy and the number of elderly persons, but the institutions able to welcome them are not sufficient. A lot of projects work on ways allowing elderly persons to stay at home. This article describes the implementation of a sound classification and speech recognition system equipping a real flat. This system has been evaluated in uncontrolled conditions for distinguishing normal sentences from distress ones; these sentences are uttered by heterogeneous speakers. The detected signals are first classified as sound and speech. The sounds are clustered in eight classes (object fall, doors clap, phone ringing, steps, dishes, doors lock, screams and glass breaking). As for speech signals, an input utterance (in French) is recognized and a subsequent process classifies it in normal or distress, by analysing the presence of distress key words. In the same way, some sound classes are related to a possible distress situation. An experimental protocol was defined and tested in real conditions inside the flat. Finally, we discuss the results of this experiment, where ten subjects were involved.

Full Paper

Bibliographic reference.  Vacher, Michel / Fleury, Anthony / Serignat, Jean-François / Noury, Norbert / Glasson, Hubert (2008): "Preliminary evaluation of speech/sound recognition for telemedicine application in a real environment", In INTERSPEECH-2008, 496-499.