ISCA Archive Odyssey 2004
ISCA Archive Odyssey 2004

ELISA nist RT03 broadcast news speaker diarization experiments

Daniel Moraru, Sylvain Meignier, Corinne Fredouille, Laurent Besacier, Jean-Fran├žois Bonastre

This paper presents the ELISA consortium activities in automatic speaker diarization (also known as speaker segmentation) during the NIST Rich Transcription (RT) 2003 evaluation. The experiments were achieved on real broadcast news data (HUB4), in the framework of the ELISA consortium. The paper firstly shows the interest of segmentation in acoustic macro classes (like gender or bandwidth) as a front-end processing for segmentation/diarization task. The impact of this prior acoustic segmentation is evaluated in terms of speaker diarization performance. Secondly, two different approaches from CLIPS and LIA laboratories are presented and different possibilities of combining them are investigated. The system submitted as ELISA primary obtained the second lower diarization error rate compared to the other RT03-participant primary systems. Another ELISA system submitted as secondary outperformed the best primary system (i.e. it obtained the lowest speaker diarization error rate).


Cite as: Moraru, D., Meignier, S., Fredouille, C., Besacier, L., Bonastre, J.-F. (2004) ELISA nist RT03 broadcast news speaker diarization experiments. Proc. The Speaker and Language Recognition Workshop (Odyssey 2004), 23-28

@inproceedings{moraru04_odyssey,
  author={Daniel Moraru and Sylvain Meignier and Corinne Fredouille and Laurent Besacier and Jean-Fran├žois Bonastre},
  title={{ELISA nist RT03 broadcast news speaker diarization experiments}},
  year=2004,
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2004)},
  pages={23--28}
}