5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

On the Significance of Temporal Masking in Speech Coding

Jan Skoglund (1), W. Bastiaan Kleijn (2)

(1) Chalmers University of Technology, Department of Signals and Systems, Sweden
(2) Royal Institute of Technology, Department of Speech, Music and Hearing, Sweden

This paper addresses the issue of masking of noise in voiced speech. First, we examine the audibility of cyclostationary narrow-band noise added to voiced speech generated by synthetic excitation. Varying the temporal location of noise within a pitch cycle corresponds to varying its phase spectrum. Using this fact, we find that a phase change of the noise in the high frequency region is more perceptible for a low-pitched sound than for a high-pitched sound. We propose a pitch-dependent temporal weighting function and we show experimentally that it is beneficial to the quantization of pitch-cycle waveforms.

