ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

An adaptive BIC approach for robust audio stream segmentation

Janez Žibert, Andrej Brodnik, France Mihelič

In this paper we focus on an audio segmentation. We present a novel method for robust estimation of decision-thresholds for accurate detection of acoustic change points in continuous audio streams. In standard segmentation procedures the decisionthresholds are usually set in advance and need to be tuned from development data. In the presented approach we tried to remove a need for using pre-determined decision-thresholds and propose a method for estimation of thresholds directly from the currently processed audio data. It employs change-detection methods from two well-established audio segmentation approaches based on the Bayesian Information Criterion. Following from that, we develop two audio segmentation procedures, which enable us to adaptively tune boundary-detection thresholds and to combine different audio representations in the segmentation process. The proposed segmentation procedures are tested on broadcast news audio data.

doi: 10.21437/Interspeech.2009-669

Cite as: Žibert, J., Brodnik, A., Mihelič, F. (2009) An adaptive BIC approach for robust audio stream segmentation. Proc. Interspeech 2009, 2539-2542, doi: 10.21437/Interspeech.2009-669

  author={Janez Žibert and Andrej Brodnik and France Mihelič},
  title={{An adaptive BIC approach for robust audio stream segmentation}},
  booktitle={Proc. Interspeech 2009},