8th Annual Conference of the International Speech Communication Association

Antwerp, Belgium
August 27-31, 2007

Varying Input Segmentation for Story Boundary Detection in English, Arabic and Mandarin Broadcast News

Andrew Rosenberg, Mehrbod Sharifi, Julia Hirschberg

Columbia University, USA

Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent processes such as question answering and information retrieval. In previous work, a decision tree trained on automatically extracted lexical and acoustic features was trained to predict story boundaries, using hypothesized sentence boundaries to define potential story boundaries. In this paper, we empirically evaluate several alternatives to this choice of input segmentation on three languages: English, Mandarin and Arabic. Our results suggest that the best performance can be achieved by using 250ms pause-based segmentation or sentence boundaries determined using a very low confidence score threshold.

Full Paper

Bibliographic reference.  Rosenberg, Andrew / Sharifi, Mehrbod / Hirschberg, Julia (2007): "Varying input segmentation for story boundary detection in English, Arabic and Mandarin broadcast news", In INTERSPEECH-2007, 2589-2592.