Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Analysis-by-Synthesis Low-Rate Multimode Harmonic Speech Coding

Chunyan Li, Allen Gersho, Vladimir Cuperman

Department of Electrical and Computer Engineering, University of California, Santa Barabara, CA, USA

This paper presents an analysis-by-synthesis multimode harmonic coder(AbS-MHC) that employs new techniques to improve both the speech model accuracy and the parameter estimation robustness in the low rate harmonic coding framework. To improve the speech model accuracy, an enhanced frequency domain transition model is used in conjunction with the sinusoidal model based harmonic coding of voiced/unvoiced speech signals. To achieve robust parameter estimation, a generalized analysis-by-synthesis parameter estimation scheme in the harmonic coding framework is proposed. This scheme uses a time scale signal modification technique to allow for waveform matching in harmonic coding. This concept is demonstrated in our AbS-MHC coder with a specific method for efficient closed-loop pitch estimation and speech classification. The speech quality of the unquantized AbS-MHC coder is better than the 6.3 kbps G.723 quality.

Full Paper (PDF)

Acoustic Example #1 (F4K)
Acoustic Example #2 (FOR)
Acoustic Example #3 (M4K)
Acoustic Example #4 (MOR)

Bibliographic reference.  Li, Chunyan / Gersho, Allen / Cuperman, Vladimir (1999): "Analysis-by-synthesis low-rate multimode harmonic speech coding", In EUROSPEECH'99, 1451-1454.