ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing

ICC Jeju, Korea
October 3, 2004

PLP-Squared: Autoregressive Modeling of Auditory-Like 2-D Spectro-Temporal Patterns

Marios Athineos (1), Hynek Hermansky (2), Daniel P. W. Ellis (1)

(1) LabROSA, Dept. of Electrical Engineering, Columbia University, New York, NY, USA
(2) IDIAP Research Institute, Martigny, Switzerland

The temporal trajectories of the spectral energy in auditory critical bands over 250 ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further smoothed by iterative alternation of spectral and temporal all-pole modeling. Just as Perceptual Linear Prediction (PLP) uses an autoregressive model in the frequency domain to estimate peaks in an auditory-like short-term spectral slice, PLP2 uses all-pole modeling in both time and frequency domains to estimate peaks of a two-dimensional spectrotemporal pattern, motivated by considerations of the auditory system.


Full Paper

Bibliographic reference.  Athineos, Marios / Hermansky, Hynek / Ellis, Daniel P. W. (2004): "PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns", In SAPA-2004, paper 129.