BUT System for DIHARD Speech Diarization Challenge 2018

Mireia Diez, Federico Landini, Lukáš Burget, Johan Rohdin, Anna Silnova, Kateřina Žmolíková, Ondřej Novotný, Karel Veselý, Ondřej Glembek, Oldřich Plchot, Ladislav Mošner, Pavel Matějka


This paper presents the approach developed by the BUT team for the first DIHARD speech diarization challenge, which is based on our Bayesian Hidden Markov Model with eigenvoice priors system. Besides the description of the approach, we provide a brief analysis of different techniques and data processing methods tested on the development set. We also introduce a simple attempt for overlapped speech detection that we used for attaining cleaner speaker models and reassigning overlapped speech to multiple speakers. Finally, we present results obtained on the evaluation set and discuss findings we made during the development phase and with the help of the DIHARD leaderboard feedback.


 DOI: 10.21437/Interspeech.2018-1749

Cite as: Diez, M., Landini, F., Burget, L., Rohdin, J., Silnova, A., Žmolíková, K., Novotný, O., Veselý, K., Glembek, O., Plchot, O., Mošner, L., Matějka, P. (2018) BUT System for DIHARD Speech Diarization Challenge 2018. Proc. Interspeech 2018, 2798-2802, DOI: 10.21437/Interspeech.2018-1749.


@inproceedings{Diez2018,
  author={Mireia Diez and Federico Landini and Lukáš Burget and Johan Rohdin and Anna Silnova and Kateřina Žmolíková and Ondřej Novotný and Karel Veselý and Ondřej Glembek and Oldřich Plchot and Ladislav Mošner and Pavel Matějka},
  title={BUT System for DIHARD Speech Diarization Challenge 2018},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={2798--2802},
  doi={10.21437/Interspeech.2018-1749},
  url={http://dx.doi.org/10.21437/Interspeech.2018-1749}
}