EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

Optimization of the CELP Model in the LSP Domain

Khosrow Lashkari, Toshio Miki

DoCoMo USA Labs, USA

This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient descent algorithm, LSPs for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For the ITU G.729 codec, there is about 1dB of improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. By adding an extra optimization step, the technique can be incorporated into the LPC, multi-pulse LPC and CELP-type speech coders.

Full Paper

Bibliographic reference.  Lashkari, Khosrow / Miki, Toshio (2003): "Optimization of the CELP model in the LSP domain", In EUROSPEECH-2003, 1709-1712.