ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A sinusoidal model based on frequency-to-instantaneous frequency mapping

Parham Zolfaghari, Hideki Kawahara

In this paper we describe a sinusoidal analysis and synthesis framework which uses a novel method of extracting the sinusoidal components and fundamental frequency. This method is based on a mapping from linearly spaced filter centre frequencies to the instantaneous frequencies of the filter outputs. Frequency domain fixed points are obtained from this mapping which result in the extraction of the constituent sinusoidal components of the input signal. A robust fundamental frequency extraction technique based on a wavelet representation of this model is also used. These form the essential parts of the sinusoidal analysis framework which also includes a sinusoidal component trajectory continuation scheme. In order to reconstruct the spectrum, the inverse FFT method is used in synthesis [1]. This model has been shown to produce speech of high quality and is also applicable to other sound sources.

Sdepalle, P., and Rodet, X. Synth├Ęse additive par FFT inverse. Rapport Interne IRCAM (1990).


Cite as: Zolfaghari, P., Kawahara, H. (2000) A sinusoidal model based on frequency-to-instantaneous frequency mapping. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 692-695

@inproceedings{zolfaghari00b_icslp,
  author={Parham Zolfaghari and Hideki Kawahara},
  title={{A sinusoidal model based on frequency-to-instantaneous frequency mapping}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 692-695}
}