Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Pitch-Synchronous Time-Scaling for High-Frequency Excitation Regeneration

Joao P. Cabral, Luis C. Oliveira

INESC-ID/IST, Lisbon, Portugal

The goal of bandwidth extension of speech (BWE) is to extrapolate the missing low or high frequency components of the wide-band speech (50.8000 Hz) based entirely on information contained in a narrow-band signal (300.3400 Hz). In this paper we propose a new method for high-frequency regeneration of the excitation signal, using the correlation between the shape of the glottal flow waveform and the spectrum of the voice source. The high-band excitation is generated by performing a pitch-synchronous timescale (PSTS) transformation on the linear prediction narrow-band residual to generate an high-pass signal that retains the periodic characteristics of the original signal but with a larger open quotient. This method is easy to implement and does not introduce discontinuities in the spectrum of the regenerated excitation. It can be used in applications for BWE where no side information is transmitted or for low bit coding of wide-band speech.

Full Paper

Bibliographic reference.  Cabral, Joao P. / Oliveira, Luis C. (2005): "Pitch-synchronous time-scaling for high-frequency excitation regeneration", In INTERSPEECH-2005, 1513-1516.