Sixth International Conference on Spoken Language Processing
In this paper we describe a sinusoidal analysis and synthesis framework which uses a novel method of extracting the sinusoidal components and fundamental frequency. This method is based on a mapping from linearly spaced filter centre frequencies to the instantaneous frequencies of the filter outputs. Frequency domain fixed points are obtained from this mapping which result in the extraction of the constituent sinusoidal components of the input signal. A robust fundamental frequency extraction technique based on a wavelet representation of this model is also used. These form the essential parts of the sinusoidal analysis framework which also includes a sinusoidal component trajectory continuation scheme. In order to reconstruct the spectrum, the inverse FFT method is used in synthesis . This model has been shown to produce speech of high quality and is also applicable to other sound sources.
Bibliographic reference. Zolfaghari, Parham / Kawahara, Hideki (2000): "A sinusoidal model based on frequency-to-instantaneous frequency mapping", In ICSLP-2000, vol.4, 692-695.