Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Morphing Spectral Envelopes Using Audio Flow

Tony Ezzat, Ethan Meyers, James Glass, Tomaso Poggio

Massachusetts Institute of Technology, USA

We present a method for morphing between smooth spectral magnitude envelopes of speech. An important element of our method is the notion of audio flow, which is inspired by similar notions of optical flow computed between images in computer vision applications. Audio flow defines the correspondence between two smooth spectral magnitude envelopes, and encodes the formant shifting that occurs from one sound to another. We present several algorithms for the automatic computation of audio flow from a small 20 second corpus of speech. In addition, we present an algorithm for morphing smoothly between any two spectral magnitude envelopes, given the computed audio flow between them.

Full Paper

Bibliographic reference.  Ezzat, Tony / Meyers, Ethan / Glass, James / Poggio, Tomaso (2005): "Morphing spectral envelopes using audio flow", In INTERSPEECH-2005, 2545-2548.