ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Improving language identification robustness to highly channel-degraded speech through multiple system fusion

Aaron Lawson, Mitchell McLaren, Yun Lei, Vikramjit Mitra, Nicolas Scheffer, Luciana Ferrer, Martin Graciarena

We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high noise conditions. Work presented here includes novel approaches to scoring iVectors, the introduction of several new acoustic and prosodic features for language identification, and discriminative file selection approaches to score calibration. Further, we explore the use of Discrete Cosine Transforms (DCT) as a supplement to traditional context modeling with Shifted Delta Cepstrum (SDC) and fusion of multiple iVector systems based on Gaussian backends, neural networks, and adaptive Gaussian backend modeling.


doi: 10.21437/Interspeech.2013-389

Cite as: Lawson, A., McLaren, M., Lei, Y., Mitra, V., Scheffer, N., Ferrer, L., Graciarena, M. (2013) Improving language identification robustness to highly channel-degraded speech through multiple system fusion. Proc. Interspeech 2013, 1507-1510, doi: 10.21437/Interspeech.2013-389

@inproceedings{lawson13_interspeech,
  author={Aaron Lawson and Mitchell McLaren and Yun Lei and Vikramjit Mitra and Nicolas Scheffer and Luciana Ferrer and Martin Graciarena},
  title={{Improving language identification robustness to highly channel-degraded speech through multiple system fusion}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1507--1510},
  doi={10.21437/Interspeech.2013-389}
}