ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Assessment of objective quality measures for speech intelligibility

W. M. Liu, K. A. Jellyman, N. W. D. Evans, John S. D. Mason

This paper assesses 9 prominent objective quality measures for their potential in intelligibility estimation. Degradation considered include additive noises and those introduced by coding and enhancement schemes, totalling 78 types. This paper is believed to be the first to conduct an assessment on such a large combination of quality measures and degradations allowing side-by-side analysis. Experimental results show that the sophisticated perceptual-based measures which are superior for quality estimation, do not necessarily correlate well with human intelligibility and, in fact, give poorer correlations when enhancement schemes are considered. Meanwhile, the weighted spectral slope (WSS) emerges to be the most promising approach among all measures considered, scoring the highest correlation in 5 out of the 6 test sets. Worth noting are the positive correlations obtained with WSS which range from 0.14 to 0.86, as opposed to those with PESQ from -0.58 to 0.74. Such findings put WSS, a relatively conventional measure, in a new light as a potential intelligibility assessor.

doi: 10.21437/Interspeech.2008-220

Cite as: Liu, W.M., Jellyman, K.A., Evans, N.W.D., Mason, J.S.D. (2008) Assessment of objective quality measures for speech intelligibility. Proc. Interspeech 2008, 699-702, doi: 10.21437/Interspeech.2008-220

  author={W. M. Liu and K. A. Jellyman and N. W. D. Evans and John S. D. Mason},
  title={{Assessment of objective quality measures for speech intelligibility}},
  booktitle={Proc. Interspeech 2008},