Cenatav Voice Group System for Albayzin 2018 Search on Speech Evaluation

Ana R. Montalvo, Jose M. Ramirez, Alejandro Roble, Jose R. Calvo


This paper presents the system employed in the Albayzin 2018 "Search on Speech" Evaluation by the Voice Group of CENATAV. The system used in the Spoken Term Detection (STD) task consists on an Automatic Speech Recognizer (ASR) and a module to detect the terms. The open source Kaldi toolkit is used to build both modules. ASR acoustic models are based on DNN-HMM, S-GMM or GMM-HMM, trained with audio data provided by the organizers and other obtained from ELDA. The lexicon and trigram language model are obtained from the text associated to the audio. The ASR generates the lattices and the word alignments required to detect the terms. Results with development data shown that DNN-HMM model brings up a behavior better or similar to obtained in previous challenges.


 DOI: 10.21437/IberSPEECH.2018-53

Cite as: Montalvo, A.R., Ramirez, J.M., Roble, A., Calvo, J.R. (2018) Cenatav Voice Group System for Albayzin 2018 Search on Speech Evaluation. Proc. IberSPEECH 2018, 254-256, DOI: 10.21437/IberSPEECH.2018-53.


@inproceedings{Montalvo2018,
  author={Ana R. Montalvo and Jose M. Ramirez and Alejandro Roble and Jose R. Calvo},
  title={{Cenatav Voice Group System for Albayzin 2018 Search on Speech Evaluation}},
  year=2018,
  booktitle={Proc. IberSPEECH 2018},
  pages={254--256},
  doi={10.21437/IberSPEECH.2018-53},
  url={http://dx.doi.org/10.21437/IberSPEECH.2018-53}
}