S4D: Speaker Diarization Toolkit in Python

Pierre-Alexandre Broux, Florent Desnous, Anthony Larcher, Simon Petitrenaud, Jean Carrive, Sylvain Meignier


In this paper, we present S4D, a new open-source Python toolkit dedicated to speaker diarization. S4D provides various state-of-the-art components and the possibility to easily develop end-to-end diarization prototype systems. S4D offers a large panel of clustering, segmentation, scoring and visualization algorithms. S4D has been thought to be easily understood, installed, modified and used in order to allow fast transfers of diarization technologies to industry and facilitate development of new approaches. Examples, benchmarks on standard tasks and tutorials are provided in this paper. S4D is an extension of the open-source toolkit for speaker recognition: SIDEKIT.


 DOI: 10.21437/Interspeech.2018-1232

Cite as: Broux, P., Desnous, F., Larcher, A., Petitrenaud, S., Carrive, J., Meignier, S. (2018) S4D: Speaker Diarization Toolkit in Python. Proc. Interspeech 2018, 1368-1372, DOI: 10.21437/Interspeech.2018-1232.


@inproceedings{Broux2018,
  author={Pierre-Alexandre Broux and Florent Desnous and Anthony Larcher and Simon Petitrenaud and Jean Carrive and Sylvain Meignier},
  title={S4D: Speaker Diarization Toolkit in Python},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1368--1372},
  doi={10.21437/Interspeech.2018-1232},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1232}
}