ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Intentional voice command detection for completely hands-free speech interface in home environments

Yasunari Obuchi, Masahito Togami, Takashi Sumiyoshi

We introduce a new class of speech processing, called Intentional Voice Command Detection (IVCD). It is necessary to reject not only noises but also unintended voices to achieve completely hands-free speech interface. Conventional VAD framework is not sufficient for such purpose, and we discuss how we should define IVCD and how we can realize it. We investigate implementation of IVCD from the viewpoint of feature extraction and classification, and show that the combination of various features and SVM can achieve IVCD accuracy of 93.2% for a large-scale audio database in real home environments.


doi: 10.21437/Interspeech.2008-27

Cite as: Obuchi, Y., Togami, M., Sumiyoshi, T. (2008) Intentional voice command detection for completely hands-free speech interface in home environments. Proc. Interspeech 2008, 119-122, doi: 10.21437/Interspeech.2008-27

@inproceedings{obuchi08_interspeech,
  author={Yasunari Obuchi and Masahito Togami and Takashi Sumiyoshi},
  title={{Intentional voice command detection for completely hands-free speech interface in home environments}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={119--122},
  doi={10.21437/Interspeech.2008-27}
}