INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Speaker Segmentation and Clustering in Meetings

Qin Jin, Tanja Schultz

Carnegie Mellon University, USA

This paper describes the automatic speaker segmentation and clustering system for natural, multi-speaker meeting conversations based on multiple distant microphones. The system was evaluated in the NIST RT-04S Meeting Recognition Evaluation on the speaker diarization task and achieved speaker diarization performance of 28.17%. This system also aims to provide automatic speech segments and speaker grouping information for speech recognition, a necessary prerequisite for subsequent audio processing. A 44.5% word error rate was achieved for speech recognition.

Full Paper

Bibliographic reference.  Jin, Qin / Schultz, Tanja (2004): "Speaker segmentation and clustering in meetings", In INTERSPEECH-2004, 597-600.