ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss

Xu Zhang, Xinlei Ren, Xiguang Zheng, Lianwu Chen, Chen Zhang, Liang Guo, Bing Yu

Speech enhancement approaches based on deep neural network have outperformed the traditional signal processing methods. This paper presents a low-delay speech enhancement method that employs a new perceptually motivated training target and loss function. The proposed approach can achieve similar speech enhancement performance compared to the state-of-the-art approaches, but with significantly less latency and computational complexities. Judged by the MOS tests conducted by the INTERSPEECH 2021 Deep Noise Suppression Challenge organizer, the proposed method is ranked the 2nd place for Background Noise MOS, and the 6th place for overall MOS.


doi: 10.21437/Interspeech.2021-1410

Cite as: Zhang, X., Ren, X., Zheng, X., Chen, L., Zhang, C., Guo, L., Yu, B. (2021) Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss. Proc. Interspeech 2021, 2826-2830, doi: 10.21437/Interspeech.2021-1410

@inproceedings{zhang21t_interspeech,
  author={Xu Zhang and Xinlei Ren and Xiguang Zheng and Lianwu Chen and Chen Zhang and Liang Guo and Bing Yu},
  title={{Low-Delay Speech Enhancement Using Perceptually Motivated Target and Loss}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={2826--2830},
  doi={10.21437/Interspeech.2021-1410}
}