ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Summarization of spontaneous conversations

Xiaodan Zhu, Gerald Penn

Most speech summarization research is conducted on broadcast news. In our viewpoint, spontaneous conversations are a more "typical" speech source that distinguishes speech summarization from text summarization, and hence a more appropriate domain for studying speech summarization. For example, spontaneous conversations contain more spoken-language characteristics, e.g. disfluencies and false starts. They are also more vulnerable to ASR errors. Previous research has studied some aspects of this type of data, but this paper addresses the problem further in several important respects. First, we summarize spontaneous conversations with features of a wide variety that have not been explored before. Second, we examine the role of disfluencies in summarization, which in all previous work was either not explicitly handled or removed as noise. Third, we breakdown and analyze the impact of WER on the individual features for summarization.

doi: 10.21437/Interspeech.2006-430

Cite as: Zhu, X., Penn, G. (2006) Summarization of spontaneous conversations. Proc. Interspeech 2006, paper 1899-Wed1WeS.2, doi: 10.21437/Interspeech.2006-430

  author={Xiaodan Zhu and Gerald Penn},
  title={{Summarization of spontaneous conversations}},
  booktitle={Proc. Interspeech 2006},
  pages={paper 1899-Wed1WeS.2},