Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

All-Pole Model Estimation of Vocal Tract on the Frequency Domain

Luis Weruaga (1), Amar Al-Khayat (2)

(1) Austrian Academy of Sciences, Austria; (2) Philips Austria GmbH, Austria

Probably the most powerful method for speech analysis is the linear prediction analysis, or LPC analysis, one of its main characteristics being the estimation of time-domain related parameters from time-domain samples. This paper proposes a novel speech analysis framework for estimating the spectral poles directly from spectral samples in voiced speech utterances. The method can be described in plain words as the task of fitting the spectral envelope of an all-pole model directly on the log energy of the harmonics. This problem is addressed with an analysis-by-synthesis mechanism supported on a Newton-Raphson algorithm of fast convergence. The proposed method differs clearly from previous approaches commonly used in Harmonic or Sinusoidal Coding. Comparative results on synthetic signals show the excellent performance of the novel analysis technique.

Full Paper

Bibliographic reference.  Weruaga, Luis / Al-Khayat, Amar (2006): "All-pole model estimation of vocal tract on the frequency domain", In INTERSPEECH-2006, paper 1188-Tue2A1O.2.