ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

NMF-based temporal feature integration for acoustic event classification

Jimmy Ludeña-Choez, Ascensión Gallardo-Antolín

In this paper, we propose a new front-end for Acoustic Event Classification tasks (AEC) based on the combination of the temporal feature integration technique called Filter Bank Coefficients (FC) and Non-Negative Matrix Factorization (NMF). FC aims to capture the dynamic structure in the short-term features by means of the summarization of the periodogram of each short-term feature dimension in several frequency bands using a predefined filter bank. As the commonly used filter bank has been devised for other tasks (such as music genre classification), it can be suboptimal for AEC. In order to overcome this drawback, we propose an unsupervised method based on NMF for learning the filters which collect the most relevant temporal information in the short-time features for AEC. The experiments show that the features obtained with this method achieve significant improvements in the classification performance of a Support Vector Machine (SVM) based AEC system in comparison with the baseline FC features.


doi: 10.21437/Interspeech.2013-653

Cite as: Ludeña-Choez, J., Gallardo-Antolín, A. (2013) NMF-based temporal feature integration for acoustic event classification. Proc. Interspeech 2013, 2924-2928, doi: 10.21437/Interspeech.2013-653

@inproceedings{ludenachoez13_interspeech,
  author={Jimmy Ludeña-Choez and Ascensión Gallardo-Antolín},
  title={{NMF-based temporal feature integration for acoustic event classification}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2924--2928},
  doi={10.21437/Interspeech.2013-653}
}