ITRW on Non-Linear Speech Processing (NOLISP 05)

Barcelona, Spain
April 19-22, 2005

Multiresolution Sinusoidal Speech Model Using Elliptic Band Pass Filter

Kihong Kim (1), Jinkeun Hong (2), Jongin Lim (3)

(1) National Security Research Institute, Yuseong, Daejeon, Korea
(2) Division of Information & Communication, Cheonan University, Cheonan-si, Chungnam, Korea
(3) Graduate School of Information Security, Korea University, Seoul, Korea

The sinusoidal speech model represents a speech signal as a linear combination of sinusoids with time-varying parameters {amplitudes, frequencies, and phases}. However, one drawback of this model is that the analysis window width is generally fixed in analyzing the signal. Since each sinusoidal parameter has different frequencies, an analysis window with fixed width can’t guarantee an optimum spectral resolution to each sinusoidal parameter. In this paper, we propose and implement a multiresolution sinusoidal speech model using elliptic band pass filter to overcome this drawback and to estimate the sinusoidal parameters more precisely. Experimental results have shown that the proposed model can achieve better performance than that of the classical sinusoidal model.

Full Paper

Bibliographic reference.  Kim, Kihong / Hong, Jinkeun / Lim, Jongin (2005): "Multiresolution sinusoidal speech model using elliptic band pass filter", In NOLISP-2005, 70-75.