5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Wideband-Speech APVQ Coding from 16 to 32 kbps

Josep M. Salavedra

Department of Signal Theory and Communications, Universitat Politecnica de Catalunya, Campus Nord UPC, BARCELONA, SPAIN

This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization and Adaptive Prediction as it is represented in Fig.1. Speech signal is split in 16 subbands by means of a QMF filter bank and so every subband is 500Hz wide. This APVQ encoder can be seen either as a vectorial extension of a conventional ADPCM encoder or as a scalar Subband AVPC encoder [1],[3]. In this scheme, signal vector is formed with one sample of the normalized prediction error signal coming from different subbands and then it is vector quantized. Prediction error signal is normalized by its gain and normalized prediction error signal is the input of the VQ and therefore an adaptive Gain-Shape VQ is considered. This APVQ Encoder combines the advantages of Scalar Prediction and those of Vector Quantization. We evaluate wideband speech coding in the range from 1 to 2 bits/sample.

Full Paper

Bibliographic reference.  Salavedra, Josep M. (1997): "Wideband-speech APVQ coding from 16 to 32 kbps", In EUROSPEECH-1997, 1511-1514.