INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

An Instrumental Measure for End-to-End Speech Transmission Quality Based on Perceptual Dimensions: Framework and Realization

Marcel Wältermann (1), Kirstin Scholz (2), Sebastian Möller (1), Lu Huo (2), Alexander Raake (1), Ulrich Heute (2)

(1) Technische Universität Berlin, Germany; (2) Christian-Albrechts-Universität zu Kiel, Germany

In this contribution, a new instrumental measure for end-to-end speech transmission quality is presented which is based on perceptually relevant dimensions. The paper describes the complete scientific development process of such a measure, starting off from the general framework and concluding with the concrete realization. The measure is based on the dimensions "discontinuity", "noisiness", and "coloration", which were identified through multidimensional analyses. Three dimension estimators are introduced which are capable to predict so-called dimension impairment factors on the basis of signal parameters. Each dimension impairment factor reflects the degradation with respect to a single perceptual dimension. By combining the impairment factors, integral quality can be estimated. A maximum correlation of r = 0.9 with auditory test results is achieved for a wide range of perceptually different conditions.

Full Paper

Bibliographic reference.  Wältermann, Marcel / Scholz, Kirstin / Möller, Sebastian / Huo, Lu / Raake, Alexander / Heute, Ulrich (2008): "An instrumental measure for end-to-end speech transmission quality based on perceptual dimensions: framework and realization", In INTERSPEECH-2008, 61-64.