ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Training audio events detectors with a sound effects corpus

Isabel Trancoso, Jose Portelo, Miguel Bugalho, João Neto, Antonio Serralheiro

This paper describes the work done in the framework of the VIDIVIDEO European project in terms of audio event detection. Our first experiments concerned the detection of non-voice sounds, such as birds, machines, traffic, water and steps. Given the unavailability of a corpus labelled in terms of audio events, we used a relatively small sound effect corpus for training. Our initial experiments with one-against-all SVM classifiers for these 5 classes showed us the feasibility of using this type of data for training, thus avoiding the extremely morose task of manual labelling of a very high number of audio events. Preliminary integration experiments are quite promising.


doi: 10.21437/Interspeech.2008-631

Cite as: Trancoso, I., Portelo, J., Bugalho, M., Neto, J., Serralheiro, A. (2008) Training audio events detectors with a sound effects corpus. Proc. Interspeech 2008, 2546-2549, doi: 10.21437/Interspeech.2008-631

@inproceedings{trancoso08_interspeech,
  author={Isabel Trancoso and Jose Portelo and Miguel Bugalho and João Neto and Antonio Serralheiro},
  title={{Training audio events detectors with a sound effects corpus}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={2546--2549},
  doi={10.21437/Interspeech.2008-631}
}