ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization

Marcel Wältermann, Kirstin Scholz, Sebastian Möller, Lu Huo, Alexander Raake, Ulrich Heute

In this contribution, a new instrumental measure for end-to-end speech transmission quality is presented which is based on perceptually relevant dimensions. The paper describes the complete scientific development process of such a measure, starting off from the general framework and concluding with the concrete realization. The measure is based on the dimensions "discontinuity", "noisiness", and "coloration", which were identified through multidimensional analyses. Three dimension estimators are introduced which are capable to predict so-called dimension impairment factors on the basis of signal parameters. Each dimension impairment factor reflects the degradation with respect to a single perceptual dimension. By combining the impairment factors, integral quality can be estimated. A maximum correlation of r = 0.9 with auditory test results is achieved for a wide range of perceptually different conditions.


doi: 10.21437/Interspeech.2008-13

Cite as: Wältermann, M., Scholz, K., Möller, S., Huo, L., Raake, A., Heute, U. (2008) An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization. Proc. Interspeech 2008, 61-64, doi: 10.21437/Interspeech.2008-13

@inproceedings{waltermann08_interspeech,
  author={Marcel Wältermann and Kirstin Scholz and Sebastian Möller and Lu Huo and Alexander Raake and Ulrich Heute},
  title={{An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={61--64},
  doi={10.21437/Interspeech.2008-13}
}