EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Perceptual Irrelevancy Removal in Narrowband Speech Coding

Marja Lahdekorpi (1), Jani Nurminen (2), Ari Heikkinen (2), Jukka Saarinen (2)

(1) Tampere University of Technology, Finland
(2) Nokia Research Center, Finland

A masking model originally designed for audio signals is applied to narrowband speech. The model is used to detect and remove the perceptually irrelevant simultaneously masked frequency components of a speech signal. Objective measurements have shown that the modified speech signal can be coded more efficiently than the original signal. Furthermore, it has been confirmed through perceptual evaluation that the removal of these frequency components does not cause significant degradation of the speech quality but rather, it has consistently improved the output quality of two standardized speech codecs. Thus, the proposed irrelevancy removal technique can be used at the front end of a speech coder to achieve enhanced coding efficiency.

Full Paper

Bibliographic reference.  Lahdekorpi, Marja / Nurminen, Jani / Heikkinen, Ari / Saarinen, Jukka (2003): "Perceptual irrelevancy removal in narrowband speech coding", In EUROSPEECH-2003, 1081-1084.