ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Frequency-domain parameter estimations for binary masked signals

J. X. Zhang, Mads Græsbøll Christensen, Joachim Dahl, Søren Holdt Jensen, Marc Moonen

We present an approach for the extraction of parameters of a damped complex exponential model from a spectrogram modified by a binary mask. The parameters are estimated by a frequency domain based methods using subspace techniques, where the core algorithm is F-ESPRIT. The sub-band defined by the binary mask provides a reduced number of DFT-samples for the parameter extractions, which results in a computational efficient scheme with high parameter estimation accuracy. The proposed synthesis system has synthesis performance comparable to the so-called LSEE-MSTFT. The estimated parameters can be used in many applications such as audio/speech coding, pitch estimation and pitch scale modification.

doi: 10.21437/Interspeech.2008-395

Cite as: Zhang, J.X., Christensen, M.G., Dahl, J., Jensen, S.H., Moonen, M. (2008) Frequency-domain parameter estimations for binary masked signals. Proc. Interspeech 2008, 1357-1360, doi: 10.21437/Interspeech.2008-395

  author={J. X. Zhang and Mads Græsbøll Christensen and Joachim Dahl and Søren Holdt Jensen and Marc Moonen},
  title={{Frequency-domain parameter estimations for binary masked signals}},
  booktitle={Proc. Interspeech 2008},