ODESSA/PLUMCOT at Albayzin Multimodal Diarization Challenge 2018

Benjamin Maurice, Hervé Bredin, Ruiqing Yin, Jose Patino, Héctor Delgado, Claude Barras, Nicholas Evans, Camille Guinaudeau


This paper describes ODESSA and PLUMCOT submissions to Albayzin Multimodal Diarization Challenge 2018. Given a list of people to recognize (alongside image and short video samples of those people), the task consists in jointly answering the two questions “who speaks when?” and “who appears when?”. Both consortia submitted 3 runs (1 primary and 2 contrastive) based on the same underlying mono-modal neural technologies : neural speaker segmentation, neural speaker embeddings, neural face embeddings, and neural talking-face detection. Our submissions aim at showing that face clustering and recognition can (hopefully) help to improve speaker diarization.


 DOI: 10.21437/IberSPEECH.2018-39

Cite as: Maurice, B., Bredin, H., Yin, R., Patino, J., Delgado, H., Barras, C., Evans, N., Guinaudeau, C. (2018) ODESSA/PLUMCOT at Albayzin Multimodal Diarization Challenge 2018. Proc. IberSPEECH 2018, 194-198, DOI: 10.21437/IberSPEECH.2018-39.


@inproceedings{Maurice2018,
  author={Benjamin Maurice and Hervé Bredin and Ruiqing Yin and Jose Patino and Héctor Delgado and Claude Barras and Nicholas Evans and Camille Guinaudeau},
  title={{ODESSA/PLUMCOT at Albayzin Multimodal Diarization Challenge 2018}},
  year=2018,
  booktitle={Proc. IberSPEECH 2018},
  pages={194--198},
  doi={10.21437/IberSPEECH.2018-39},
  url={http://dx.doi.org/10.21437/IberSPEECH.2018-39}
}