Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Detection of Speaker Changes in an Audio Document

Perrine Delacourt, David Kryze, Christian J. Wellekens

Institut EURECOM, Sophia Antipolis, France

This paper addresses the problemof speaker-based segmentation. The aim is to segment the audio data with respect to the speakers. In our study, we assume that no prior information on speakers is available and that people do not speak simultaneously. Our segmentation technique is operated in two passes: first, the most likely speaker changes are detected and then, they are validated or discarded during the second pass. The practical significance of this study is illustrated by applying our technique to synthesized and real data to show its efficiency and to compare its performances with another segmentation technique.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Delacourt, Perrine / Kryze, David / Wellekens, Christian J. (1999): "Detection of speaker changes in an audio document", In EUROSPEECH'99, 1195-1198.