ISCA Archive NOLISP 2007
ISCA Archive NOLISP 2007

An efficient VAD based on a generalized Gaussian PDF

O. Pernía, J. M. Gúrriz, J. Ramírez, C. G. Puntonet, I. Turias

The emerging applications of wireless speech communication are demanding increasing levels of performance in noise adverse environments together with the design of high response rate speech processing systems. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activity detector (VAD). This paper presents a new voice activity detector (VAD) for improving speech detection robustness in noisy environments and the performance of speech recognition systems. The algorithm defines an optimum likelihood ratio test (LRT) involving Multiple and correlated Observations (MCO). An analysis of the methodology for N = {2, 3} shows the robustness of the proposed approach by means of a clear reduction of the classification error as the number of observations is increased. The algorithm is also compared to different VAD methods including the G.729, AMR and AFE standards, as well as recently reported algorithms showing a sustained advantage in speech/non-speech detection accuracy and speech recognition performance.

Cite as: Pernía, O., Gúrriz, J.M., Ramírez, J., Puntonet, C.G., Turias, I. (2007) An efficient VAD based on a generalized Gaussian PDF. Proc. ITRW on Nonlinear Speech Processing (NOLISP 2007), 120-123

  author={O. Pernía and J. M. Gúrriz and J. Ramírez and C. G. Puntonet and I. Turias},
  title={{An efficient VAD based on a generalized Gaussian PDF}},
  booktitle={Proc. ITRW on Nonlinear Speech Processing (NOLISP 2007)},