The Second DIHARD Challenge: System Description for USC-SAIL Team

Tae Jin Park, Manoj Kumar, Nikolaos Flemotomos, Monisankha Pal, Raghuveer Peri, Rimita Lahiri, Panayiotis Georgiou, Shrikanth Narayanan


In this paper, we describe components that form a part of USC-SAIL team’s submissions to Track 1 and Track 2 of the second DIHARD speaker diarization challenge. We describe each module in our speaker diarization pipeline and explain the rationale behind our choice of algorithms for each module, while comparing the Diarization Error Rate (DER) against different module combinations. We propose a clustering scheme based on spectral clustering that yields competitive performance. Moreover, we introduce an overlap detection scheme and a re-segmentation system for speaker diarization and investigate their performances using controlled and in-the-wild conditions. In addition, we describe the additional components that will be integrated to our speaker diarization system. To pursue the best performance, we compare our system with the state-of-the-art methods that are presented in the previous challenge and literature. We include preliminary results of our speaker diarization system on the evaluation data from the second DIHARD challenge.


 DOI: 10.21437/Interspeech.2019-1903

Cite as: Park, T.J., Kumar, M., Flemotomos, N., Pal, M., Peri, R., Lahiri, R., Georgiou, P., Narayanan, S. (2019) The Second DIHARD Challenge: System Description for USC-SAIL Team. Proc. Interspeech 2019, 998-1002, DOI: 10.21437/Interspeech.2019-1903.


@inproceedings{Park2019,
  author={Tae Jin Park and Manoj Kumar and Nikolaos Flemotomos and Monisankha Pal and Raghuveer Peri and Rimita Lahiri and Panayiotis Georgiou and Shrikanth Narayanan},
  title={{The Second DIHARD Challenge: System Description for USC-SAIL Team}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={998--1002},
  doi={10.21437/Interspeech.2019-1903},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1903}
}