Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm

Qingju Liu, Yan Tang, Philip J.B. Jackson, Wenwu Wang


State-of-the-art binaural objective intelligibility measures (OIMs) require individual source signals for making intelligibility predictions, limiting their usability in real-time online operations. This limitation may be addressed by a blind source separation (BSS) process, which is able to extract the underlying sources from a mixture. In this study, a speech source is presented with either a stationary noise masker or a fluctuating noise masker whose azimuth varies in a horizontal plane, at two speech-to-noise ratios (SNRs). Three binaural OIMs are used to predict speech intelligibility from the signals separated by a BSS algorithm. The model predictions are compared with listeners’ word identification rate in a perceptual listening experiment. The results suggest that with SNR compensation to the BSS-separated speech signal, the OIMs can maintain their predictive power for individual maskers compared to their performance measured from the direct signals. It also reveals that the errors in SNR between the estimated signals are not the only factors that decrease the predictive accuracy of the OIMs with the separated signals. Artefacts or distortions on the estimated signals caused by the BSS algorithm may also be concerns.


DOI: 10.21437/Interspeech.2016-410

Cite as

Liu, Q., Tang, Y., Jackson, P.J., Wang, W. (2016) Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm. Proc. Interspeech 2016, 140-144.

Bibtex
@inproceedings{Liu+2016,
author={Qingju Liu and Yan Tang and Philip J.B. Jackson and Wenwu Wang},
title={Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-410},
url={http://dx.doi.org/10.21437/Interspeech.2016-410},
pages={140--144}
}