ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Broadcast news speaker tracking for ESTER 2005 campaign

Dan Istrate, Nicolas Scheffer, Corinne Fredouille, Jean-François Bonastre

This paper presents the speaker tracking system of the LIA laboratory, validated during ESTER 2005 campaign on a radio broadcast news corpus of about 90 h. The LIA speaker tracking system firstly uses an acoustic class segmentation in order to suppress non speech frames and to detect the speech conditions. Secondly, a speaker diarization process is applied in order to provide speaker detection system (the last step) with speaker homogeneous segments (boundaries and clustering). The speaker detection system uses UBM/GMM likelihood ratios in order to decide if a segment belongs to one tracked speaker. The speaker tracking system is presented and some results obtained during ESTER 2005 campaign are proposed. The presented systems are based on the ALIZE platform and are available thanks to an open software licence.


doi: 10.21437/Interspeech.2005-652

Cite as: Istrate, D., Scheffer, N., Fredouille, C., Bonastre, J.-F. (2005) Broadcast news speaker tracking for ESTER 2005 campaign. Proc. Interspeech 2005, 2445-2448, doi: 10.21437/Interspeech.2005-652

@inproceedings{istrate05_interspeech,
  author={Dan Istrate and Nicolas Scheffer and Corinne Fredouille and Jean-François Bonastre},
  title={{Broadcast news speaker tracking for ESTER 2005 campaign}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2445--2448},
  doi={10.21437/Interspeech.2005-652}
}