ISCA Archive Interspeech 2017
ISCA Archive Interspeech 2017

Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio

Katsuhiko Yamamoto, Toshio Irino, Toshie Matsui, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani

A new intelligibility prediction measure, called “Gammachirp Envelope Distortion Index (GEDI)” is proposed for the evaluation of speech enhancement algorithms. This model calculates the signal-to-distortion ratio (SDR) in envelope responses SDRenv derived from the gammachirp filterbank outputs of clean and enhanced speech, and is an extension of the speech based envelope power spectrum model (sEPSM) to improve prediction and usability. An evaluation was performed by comparing human subjective results and model predictions for the speech intelligibility of noise-reduced sounds processed by spectral subtraction and a recent Wiener filtering technique. The proposed GEDI predicted the subjective results of the Wiener filtering better than those predicted by the original sEPSM and well-known conventional measures, i.e., STOI, CSII, and HASPI.


doi: 10.21437/Interspeech.2017-170

Cite as: Yamamoto, K., Irino, T., Matsui, T., Araki, S., Kinoshita, K., Nakatani, T. (2017) Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio. Proc. Interspeech 2017, 2949-2953, doi: 10.21437/Interspeech.2017-170

@inproceedings{yamamoto17_interspeech,
  author={Katsuhiko Yamamoto and Toshio Irino and Toshie Matsui and Shoko Araki and Keisuke Kinoshita and Tomohiro Nakatani},
  title={{Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio}},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={2949--2953},
  doi={10.21437/Interspeech.2017-170}
}