CENATAV Voice-Group Systems for Albayzin 2018 Speaker Diarization Evaluation Campaign

Edward L. Campbell, Gabriel Hernandez, José R. Calvo de Lara


Usually, the environment to record a voice signal is not ideal and, in order to improve the representation of the speaker characteristic space, it is necessary to use a robust algorithm, thus making the representation more stable in the presence of noise. A Diarization system that focuses on the use of robust feature extraction techniques is proposed in this paper. The presented features ( such as Mean Hilbert Envelope Coefficients, Medium Duration Modulation Coefficients and Power Normalization Cepstral Coefficients ) were not used in other Albayzin Challenges. These robust techniques have a common characteristic, which is the use of a Gammatone filter-bank for dividing the voice signal in sub-bands as an alternative option to the classical Triangular filter-bank used in Mel Frequency Cepstral Coefficients. The experiment results show a more stable Diarization Error Rate in robust features than in classic features.


 DOI: 10.21437/IberSPEECH.2018-47

Cite as: Campbell, E.L., Hernandez, G., Calvo de Lara, J.R. (2018) CENATAV Voice-Group Systems for Albayzin 2018 Speaker Diarization Evaluation Campaign. Proc. IberSPEECH 2018, 227-230, DOI: 10.21437/IberSPEECH.2018-47.


@inproceedings{Campbell2018,
  author={Edward L. Campbell and Gabriel Hernandez and José R. {Calvo de Lara}},
  title={{CENATAV Voice-Group Systems for Albayzin 2018 Speaker Diarization Evaluation Campaign}},
  year=2018,
  booktitle={Proc. IberSPEECH 2018},
  pages={227--230},
  doi={10.21437/IberSPEECH.2018-47},
  url={http://dx.doi.org/10.21437/IberSPEECH.2018-47}
}