This paper addresses the issue of masking of noise in voiced speech. First, we examine the audibility of cyclostationary narrow-band noise added to voiced speech generated by synthetic excitation. Varying the temporal location of noise within a pitch cycle corresponds to varying its phase spectrum. Using this fact, we find that a phase change of the noise in the high frequency region is more perceptible for a low-pitched sound than for a high-pitched sound. We propose a pitch-dependent temporal weighting function and we show experimentally that it is beneficial to the quantization of pitch-cycle waveforms.
Cite as: Skoglund, J., Kleijn, W.B. (1998) On the significance of temporal masking in speech coding. Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998), paper 0747, doi: 10.21437/ICSLP.1998-381
@inproceedings{skoglund98_icslp, author={Jan Skoglund and W. Bastiaan Kleijn}, title={{On the significance of temporal masking in speech coding}}, year=1998, booktitle={Proc. 5th International Conference on Spoken Language Processing (ICSLP 1998)}, pages={paper 0747}, doi={10.21437/ICSLP.1998-381} }