Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech

Katsuhiko Yamamoto, Toshio Irino, Narumi Ohashi, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani


A multi-resolution version of the gammachirp envelope distortion index (mr-GEDI) is proposed for the intelligibility prediction of noisy speech processed using speech enhancement algorithms. The proposed model calculates the short-time signal-to-distortion ratio in the temporal envelope modulation extracted from the output of the gammachirp auditory filterbank. The predictions were compared with human subjective results for various signal-to-noise ratio conditions with pink and babble noise. The mr-GEDI predicts the intelligibility curves better than the hearing-aid speech perception index (HASPI).


 DOI: 10.21437/Interspeech.2018-1291

Cite as: Yamamoto, K., Irino, T., Ohashi, N., Araki, S., Kinoshita, K., Nakatani, T. (2018) Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech. Proc. Interspeech 2018, 1863-1867, DOI: 10.21437/Interspeech.2018-1291.


@inproceedings{Yamamoto2018,
  author={Katsuhiko Yamamoto and Toshio Irino and Narumi Ohashi and Shoko Araki and Keisuke Kinoshita and Tomohiro Nakatani},
  title={Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1863--1867},
  doi={10.21437/Interspeech.2018-1291},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1291}
}