We present a novel approach for recognition of overlapping sound events based on the Generalised Hough Transform (GHT) - a technique commonly used for object recognition in the domain of image processing. Unlike our previous work on image-based sound event classification, where we focussed on global image features, here we extract local features from detected interest-points in the spectrogram. These form a robust representation of the local region, and when the information from all interest-points in the spectrogram are combined using the GHT, we can form hypotheses for the location of one or more overlapping sound events in the image. Our experiments show promising results, and demonstrate the ability of our approach to recognise overlapping sounds.
Index Terms: overlapping, sound events, recognition
Bibliographic reference. Dennis, Jonathan / Tran, Huy Dat / Chng, Eng Siong (2012): "Overlapping sound event recognition using local spectrogram features with the generalised hough transform", In INTERSPEECH-2012, 2266-2269.