The temporal trajectories of the spectral energy in auditory critical bands over 250 ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further smoothed by iterative alternation of spectral and temporal all-pole modeling. Just as Perceptual Linear Prediction (PLP) uses an autoregressive model in the frequency domain to estimate peaks in an auditory-like short-term spectral slice, PLP2 uses all-pole modeling in both time and frequency domains to estimate peaks of a two-dimensional spectrotemporal pattern, motivated by considerations of the auditory system.
Cite as: Athineos, M., Hermansky, H., Ellis, D.P.W. (2004) PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns. Proc. ITRW on Statistical and Perceptual Audio Processing (SAPA 2004), paper 129
@inproceedings{athineos04_sapa, author={Marios Athineos and Hynek Hermansky and Daniel P. W. Ellis}, title={{PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns}}, year=2004, booktitle={Proc. ITRW on Statistical and Perceptual Audio Processing (SAPA 2004)}, pages={paper 129} }