ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

A novel approach of low bit-rate speech coding based on sinusoidal representation and auditory model

Wanggen Wan, Oscar C. Au, Cyan L. Keung, Chi H. Yim

In this paper, a new auditory spectrum based speech feature is proposed using sinusoidal representation and auditory model. The feature is optimized using the properties of auditory perception and masking. After quantizing and encoding the optimized feature parameters, a new speech-coding algorithm with average bit-rate of 3.25kbps is developed. The experimental results show that the synthetic speech retains most of the intelligibility and clearness of articulation of the original speech. Compared with the conventional algorithms, no voiced/unvoiced decision and pitch estimation are needed, complexity of the algorithm is much reduced, robustness and adaptation are both raised. The algorithm makes it possible to be realized with single DSP chip.


doi: 10.21437/Eurospeech.1999-350

Cite as: Wan, W., Au, O.C., Keung, C.L., Yim, C.H. (1999) A novel approach of low bit-rate speech coding based on sinusoidal representation and auditory model. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1555-1558, doi: 10.21437/Eurospeech.1999-350

@inproceedings{wan99_eurospeech,
  author={Wanggen Wan and Oscar C. Au and Cyan L. Keung and Chi H. Yim},
  title={{A novel approach of low bit-rate speech coding based on sinusoidal representation and auditory model}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1555--1558},
  doi={10.21437/Eurospeech.1999-350}
}