Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions

Pavlos Papadopoulos, Colin Vaz, Shrikanth S. Narayanan


Traditional denoising schemes require prior knowledge or statistics of the noise corrupting the signal, or estimate the noise from noise-only portions of the signal, which requires knowledge of speech boundaries. Extending denoising methods to perform well in unknown noise conditions can facilitate processing of data captured in different real life environments, and relax rigid data acquisition protocols. In this paper we propose two methods for denoising speech signals in unknown noise conditions. The first method has two stages. In the first stage we use Long Term Signal Variability features to decide which noise model to use from a pool of available models. Once we determine the noise type, we use Nonnegative Matrix Factorization with a dictionary trained on that noise to denoise the signal. In the second method, we create a combined noise dictionary from different types of noise, and use that dictionary in the denoising phase. Both of our systems improve signal quality, as measured by PESQ scores, for all the noise types we tested, and for different Signal to Noise Ratio levels.


DOI: 10.21437/Interspeech.2016-501

Cite as

Papadopoulos, P., Vaz, C., Narayanan, S.S. (2016) Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions. Proc. Interspeech 2016, 2866-2869.

Bibtex
@inproceedings{Papadopoulos+2016,
author={Pavlos Papadopoulos and Colin Vaz and Shrikanth S. Narayanan},
title={Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-501},
url={http://dx.doi.org/10.21437/Interspeech.2016-501},
pages={2866--2869}
}