11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Robust Statistical Voice Activity Detection Using a Likelihood Ratio Sign Test

Shiwen Deng, Jiqing Han

Harbin Institute of Technology, China

Voice activity detection (VAD) plays an important role on the performance of speech processing systems. Recently, more and more works focused on the statistical model-based voice activity detection (VAD) algorithms have been presented in literature, which make a decision of speech and nonspeech based on the likelihood ratio (LR). However, all the statistical models used in those algorithms are unable to exactly describe the statistics of noisy speech and various type noises. In this paper, a novel VAD algorithm is proposed based on the nonparametric detection theory by incorporating the likelihood ratio into the sign test to provide a new decision rule. Meanwhile, an optimal threshold of the proposed method is derived and the selections of relevant parameters are discussed as well. Experimental results show that the proposed VAD algorithm outperforms the conventional statistical model-based VAD.

Full Paper

Bibliographic reference.  Deng, Shiwen / Han, Jiqing (2010): "Robust statistical voice activity detection using a likelihood ratio sign test", In INTERSPEECH-2010, 3126-3129.