ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Active learning based data selection for limited resource STT and KWS

Thiago Fraga-Silva, Jean-Luc Gauvain, Lori Lamel, Antoine Laurent, Viet-Bac Le, Abdel Messaoudi

This paper presents first results in using active learning (AL) for training data selection in the context of the IARPA-Babel program. Given an initial training data set, we aim to automatically select additional data (from an untranscribed pool data set) for manual transcription. Initial and selected data are then used to build acoustic and language models for speech recognition. The goal of the AL task is to outperform a baseline system built using a pre-defined data selection with the same amount of data, the Very Limited Language Pack (VLLP) condition. AL methods based on different selection criteria have been explored. Compared to the VLLP baseline, improvements are obtained in terms of Word Error Rate and Actual Term Weighted Values for the Lithuanian language. A description of methods and an analysis of the results are given. The AL selection also outperforms the VLLP baseline for other IARPA-Babel languages, and will be further tested in the upcoming NIST OpenKWS 2015 evaluation.

doi: 10.21437/Interspeech.2015-636

Cite as: Fraga-Silva, T., Gauvain, J.-L., Lamel, L., Laurent, A., Le, V.-B., Messaoudi, A. (2015) Active learning based data selection for limited resource STT and KWS. Proc. Interspeech 2015, 3159-3163, doi: 10.21437/Interspeech.2015-636

  author={Thiago Fraga-Silva and Jean-Luc Gauvain and Lori Lamel and Antoine Laurent and Viet-Bac Le and Abdel Messaoudi},
  title={{Active learning based data selection for limited resource STT and KWS}},
  booktitle={Proc. Interspeech 2015},