EUROSPEECH 2003 - INTERSPEECH 2003
In this paper, we propose a sequential metric-based audio segmentation method that has the advantage of low computation cost of metric-based methods and the advantage of high accuracy of model-selection-based methods. There are two major differences between our method and the conventional metric-based methods:(1) Each changing point has multiple chances to be detected by different pairs of windows, rather than only once by its neighboring acoustic information.(2) By introducing the Bayesian Information Criterion(BIC) into the distance computation of two windows, we can deal with the thresholding issue more easily. We used five one-hour broadcast news shows for experiments, and the experimental results show that our method performs as well as the model-selection-based methods, but with a lower computation cost.
Bibliographic reference. Cheng, Shi-sian / Wang, Hsin-Min (2003): "A sequential metric-based audio segmentation method via the Bayesian information criterion", In EUROSPEECH-2003, 945-948.