Sixth European Conference on Speech Communication and Technology
(EUROSPEECH'99)

Budapest, Hungary
September 5-9, 1999

A Novel Approach of Low Bit-rate Speech Coding Based on Sinusoidal Representation and Auditory Model

Wanggen Wan, Oscar C. Au, Cyan L. Keung, Chi H. Yim

Department of Electrical and Electronic Engineering Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong

In this paper, a new auditory spectrum based speech feature is proposed using sinusoidal representation and auditory model. The feature is optimized using the properties of auditory perception and masking. After quantizing and encoding the optimized feature parameters, a new speech-coding algorithm with average bit-rate of 3.25kbps is developed. The experimental results show that the synthetic speech retains most of the intelligibility and clearness of articulation of the original speech. Compared with the conventional algorithms, no voiced/unvoiced decision and pitch estimation are needed, complexity of the algorithm is much reduced, robustness and adaptation are both raised. The algorithm makes it possible to be realized with single DSP chip.


Full Paper (PDF)

Bibliographic reference.  Wan, Wanggen / Au, Oscar C. / Keung, Cyan L. / Yim, Chi H. (1999): "A novel approach of low bit-rate speech coding based on sinusoidal representation and auditory model", In EUROSPEECH'99, 1555-1558.