9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

Frequency-Domain Parameter Estimations for Binary Masked Signals

J. X. Zhang (1), Mads Græsbøll Christensen (1), Joachim Dahl (1), Søren Holdt Jensen (1), Marc Moonen (2)

(1) Aalborg University, Denmark; (2) Katholieke Universiteit Leuven, Belgium

We present an approach for the extraction of parameters of a damped complex exponential model from a spectrogram modified by a binary mask. The parameters are estimated by a frequency domain based methods using subspace techniques, where the core algorithm is F-ESPRIT. The sub-band defined by the binary mask provides a reduced number of DFT-samples for the parameter extractions, which results in a computational efficient scheme with high parameter estimation accuracy. The proposed synthesis system has synthesis performance comparable to the so-called LSEE-MSTFT. The estimated parameters can be used in many applications such as audio/speech coding, pitch estimation and pitch scale modification.

Full Paper

Bibliographic reference.  Zhang, J. X. / Christensen, Mads Græsbøll / Dahl, Joachim / Jensen, Søren Holdt / Moonen, Marc (2008): "Frequency-domain parameter estimations for binary masked signals", In INTERSPEECH-2008, 1357-1360.