EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Voicing Parameter and Energy Based Speech/Non-Speech Detection for Speech Recognition in Adverse Conditions

Arnaud Martin (1), Laurent Mauuary (2)

(1) Universite de Bretagne Sud, France
(2) France Telecom R&D, France

In adverse conditions, the speech recognition performance decreases in part due to imperfect speech/non-speech detection. In this paper, a new combination of voicing parameter and energy for speech/non-speech detection is described. This combination avoids especially the noise detections in real life very noisy environments and provides better performance for continuous speech recognition. This new speech/non-speech detection approach outperforms both noise statistical based [1] and Linear Discriminate Analysis (LDA) based [2] criteria in noisy environments and for continuous speech recognition applications.

Full Paper

Bibliographic reference.  Martin, Arnaud / Mauuary, Laurent (2003): "Voicing parameter and energy based speech/non-speech detection for speech recognition in adverse conditions", In EUROSPEECH-2003, 3069-3072.