ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Silence feature normalization for robust speech recognition in additive noise environments

Chieh-cheng Wang, Chi-an Pan, Jeih-weih Hung

In this paper, we propose a simple yet very effective feature compensation scheme for two energy-related features, the logarithmic energy (logE) and the zeroth cepstral coefficient (c0), in order to improve their noise robustness. This compensation scheme, named silence feature normalization (SFN), uses the high-pass filtered features as the indicator for speech/non-speech classification, and then the features of non-speech frames are set to be small while those of speech frames are almost kept unchanged. In experiments conducted on the Aurora-2 database, SFN achieves a relative error reduction rate of nearly 50% from the baseline processing.


doi: 10.21437/Interspeech.2008-298

Cite as: Wang, C.-c., Pan, C.-a., Hung, J.-w. (2008) Silence feature normalization for robust speech recognition in additive noise environments. Proc. Interspeech 2008, 1028-1031, doi: 10.21437/Interspeech.2008-298

@inproceedings{wang08d_interspeech,
  author={Chieh-cheng Wang and Chi-an Pan and Jeih-weih Hung},
  title={{Silence feature normalization for robust speech recognition in additive noise environments}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1028--1031},
  doi={10.21437/Interspeech.2008-298}
}