ISCA Archive SAPA 2004
ISCA Archive SAPA 2004

PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns

Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis

The temporal trajectories of the spectral energy in auditory critical bands over 250 ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further smoothed by iterative alternation of spectral and temporal all-pole modeling. Just as Perceptual Linear Prediction (PLP) uses an autoregressive model in the frequency domain to estimate peaks in an auditory-like short-term spectral slice, PLP2 uses all-pole modeling in both time and frequency domains to estimate peaks of a two-dimensional spectrotemporal pattern, motivated by considerations of the auditory system.


Cite as: Athineos, M., Hermansky, H., Ellis, D.P.W. (2004) PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns. Proc. ITRW on Statistical and Perceptual Audio Processing (SAPA 2004), paper 129

@inproceedings{athineos04_sapa,
  author={Marios Athineos and Hynek Hermansky and Daniel P. W. Ellis},
  title={{PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns}},
  year=2004,
  booktitle={Proc. ITRW on Statistical and Perceptual Audio Processing (SAPA 2004)},
  pages={paper 129}
}