INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Temporal Normalization Techniques for Transform-Type Speech Coding and Application to Split-band Wideband Coders

Kyung-Tae Kim (1), Sung-Kyo Jung (1), MiSuk Lee (2), Hong-Goo Kang (1), Dae Hee Youn (1)

(1) Yonsei University, Korea
(2) ETRI, Korea

In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulse-like characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal,and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G.722 at the bit-rate of 20 kbit/s.

Full Paper

Bibliographic reference.  Kim, Kyung-Tae / Jung, Sung-Kyo / Lee, MiSuk / Kang, Hong-Goo / Youn, Dae Hee (2004): "Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders", In INTERSPEECH-2004, 2661-2664.