ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise

Wei M. Liu, John S. D. Mason, Nicholas W. D. Evans, Keith A. Jellyman

This paper investigates the potential applicability of automatic speech recognition (ASR) and 6 well-reported objective quality measures for the task of ranking intelligibility of speech degraded by different real life background noises. In a recent investigation ASR has been reported to give high subjective correlation with human assessment when tested with various system degradations. This paper extends this investigation in two directions. First, the usefulness of the measures in the context of different real-life noises is considered. Second, the direct correspondence between statistics computed by an ASR system and human perceived intelligibility is assessed. Subjective listening tests are carried out to provide ground truth. Results show that ASR and WSS (weighted spectral slope) are the only two measures out of the seven considered to give good correlation with human opinion. Specially noted is performance of ASR with correlations ranging from 0.77 to 0.90.


doi: 10.21437/Interspeech.2006-563

Cite as: Liu, W.M., Mason, J.S.D., Evans, N.W.D., Jellyman, K.A. (2006) An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise. Proc. Interspeech 2006, paper 1191-Wed3FoP.10, doi: 10.21437/Interspeech.2006-563

@inproceedings{liu06f_interspeech,
  author={Wei M. Liu and John S. D. Mason and Nicholas W. D. Evans and Keith A. Jellyman},
  title={{An assessment of automatic speech recognition as speech intelligibility estimation in the context of additive noise}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1191-Wed3FoP.10},
  doi={10.21437/Interspeech.2006-563}
}