8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

A Packetization and Variable Bitrate Interframe Compression Scheme for Vector Quantizer-Based Distributed Speech Recognition

Bengt J. Borgström, Abeer Alwan

University of California at Los Angeles, USA

We propose a novel packetization and variable bitrate compression scheme for DSR source coding, based on the Group of Pictures concept from video coding. The proposed algorithm simultaneously packetizes and further compresses source coded features using the high interframe correlation of speech, and is compatible with a variety of VQ-based DSR source coders. The algorithm approximates vector quantizers as Markov Chains, and empirically trains the corresponding probability parameters. Feature frames are then compressed as I-frames, P-frames, or B-frames, using Huffman tables. The proposed scheme can perform lossless compression, but is also robust to lossy compression through VQ pruning or frame puncturing. To illustrate its effectiveness, we applied the proposed algorithm to the ETSI DSR source coder. The algorithm provided compression rates of up to 31.60% with negligible recognition accuracy degradation, and rates of up to 71.15% with performance degradation under 1.0%.

Full Paper

Bibliographic reference.  Borgström, Bengt J. / Alwan, Abeer (2007): "A packetization and variable bitrate interframe compression scheme for vector quantizer-based distributed speech recognition", In INTERSPEECH-2007, 578-581.