INTERSPEECH 2006 - ICSLP
We present a method that analyzes a two-dimensional magnitude spectrogram S(f, t) into its local constituent spectro-temporal amplitudes A(f, t), frequencies F(f, t), orientations ƒ¦(f , t), and phases ƒÓ(f, t). The method operates by performing a two-dimensional local Gabor-like analysis of the spectrogram, retaining only the parameters of the 2D-Gabor filter with maximal amplitude response within the local region. We demonstrate the technique over a wide variety of speakers, and show how the spectrograms in each case may be adequately reconstructed using the parameters of the Max-Gabor analysis. Finally, we discuss the nature of the extracted Max-Gabor parameters.
Bibliographic reference. Ezzat, Tony / Bouvrie, Jake / Poggio, Tomaso (2006): "Max-Gabor analysis and synthesis of spectrograms", In INTERSPEECH-2006, paper 1561-Thu2BuP.5.