8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Subtopic Segmentation in the Lecture Speech

Noboru Kanedera (1), Sumida Asuka (1), Takao Ikehata (1), Tetsuo Funada (2)

(1) Ishikawa National College of Technology, Japan
(2) Kanazawa University, Japan

This paper proposes a method of segmentation that segments lecture video material into subtopics based on speech signals for creation of educational video contents. To represent subtopics of video segments, the text recognized by automatic speech recognition (ASR) from a lecture speech was converted into an index using independent component analysis (ICA) instead of conventional TF-IDF. This research attempted a method of segmentation using dynamic programming that minimizes the sum of cosine measures between adjacent indexes. The validity of the proposed method was evaluated using sample lecture videos. Results indicated that subtopic segmentation using automatic speech recognition performed as well as that using transcription text.

Full Paper

Bibliographic reference.  Kanedera, Noboru / Asuka, Sumida / Ikehata, Takao / Funada, Tetsuo (2004): "Subtopic segmentation in the lecture speech", In INTERSPEECH-2004, 1821-1824.