Wavelet Speech Enhancement Based on Robust Principal Component Analysis

Chia-Lung Wu, Hsiang-Ping Hsu, Syu-Siang Wang, Jeih-Weih Hung, Ying-Hui Lai, Hsin-Min Wang, Yu Tsao


Most state-of-the-art speech enhancement (SE) techniques prefer to enhance utterances in the frequency domain rather than in the time domain. However, the overlap-add (OLA) operation in the short-time Fourier transform (STFT) for speech signal processing possibly distorts the signal and limits the performance of the SE techniques. In this study, a novel SE method that integrates the discrete wavelet packet transform (DWPT) and a novel subspace-based method, robust principal component analysis (RPCA), is proposed to enhance noise-corrupted signals directly in the time domain. We evaluate the proposed SE method on the Mandarin hearing in noise test (MHINT) sentences. The experimental results show that the new method reduces the signal distortions dramatically, thereby improving speech quality and intelligibility significantly. In addition, the newly proposed method outperforms the STFT-RPCA-based speech enhancement system.


 DOI: 10.21437/Interspeech.2017-781

Cite as: Wu, C., Hsu, H., Wang, S., Hung, J., Lai, Y., Wang, H., Tsao, Y. (2017) Wavelet Speech Enhancement Based on Robust Principal Component Analysis. Proc. Interspeech 2017, 439-443, DOI: 10.21437/Interspeech.2017-781.


@inproceedings{Wu2017,
  author={Chia-Lung Wu and Hsiang-Ping Hsu and Syu-Siang Wang and Jeih-Weih Hung and Ying-Hui Lai and Hsin-Min Wang and Yu Tsao},
  title={Wavelet Speech Enhancement Based on Robust Principal Component Analysis},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={439--443},
  doi={10.21437/Interspeech.2017-781},
  url={http://dx.doi.org/10.21437/Interspeech.2017-781}
}