11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Toward Detecting Voice Activity Employing Soft Decision in Second-Order Conditional MAP

Sang-Kyun Kim, Jae-Hun Choi, Sang-Ick Kang, Ji-Hyun Song, Joon-Hyuk Chang

Inha University, Korea

In this paper, we propose a novel approach to statistical model-based voice activity detection (VAD) that incorporates a second-order conditional maximum a posteriori (MAP) criterion. As a technical improvement for the first-order conditional MAP criterion in, we consider both the current observation and the voice activity decision in the previous two frames to take full consideration of the inter-frame correlation of voice activity. The soft decision scheme is incorporated to result in time-varying thresholds for further performance improvement. Experimental results show that the proposed algorithm outperforms the conventional CMAP-based VAD technique under various experimental conditions.

Full Paper

Bibliographic reference.  Kim, Sang-Kyun / Choi, Jae-Hun / Kang, Sang-Ick / Song, Ji-Hyun / Chang, Joon-Hyuk (2010): "Toward detecting voice activity employing soft decision in second-order conditional MAP", In INTERSPEECH-2010, 3082-3085.