Within-Speaker Features for Native Language Recognition in the Interspeech 2016 Computational Paralinguistics Challenge

Mark Huckvale


The Interspeech 2016 Native Language recognition challenge was to identify the first language of 867 speakers from their spoken English. Effectively this was an L2 accent recognition task where the L1 was one of eleven languages. The lack of transcripts of the spontaneous speech recordings meant that the currently best performing accent recognition approach (ACCDIST) developed by the author could not be applied. Instead, the objectives of this study were to explore whether within-speaker features found to be effective in ACCDIST would also have value within a contemporary GMM-based accent recognition approach. We show that while Gaussian mean supervectors provide the best performance on this task, small gains may be had by fusing the mean supervector system with a system based on within-speaker Gaussian mixture distances.


DOI: 10.21437/Interspeech.2016-1466

Cite as

Huckvale, M. (2016) Within-Speaker Features for Native Language Recognition in the Interspeech 2016 Computational Paralinguistics Challenge. Proc. Interspeech 2016, 2403-2407.

Bibtex
@inproceedings{Huckvale2016,
author={Mark Huckvale},
title={Within-Speaker Features for Native Language Recognition in the Interspeech 2016 Computational Paralinguistics Challenge},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1466},
url={http://dx.doi.org/10.21437/Interspeech.2016-1466},
pages={2403--2407}
}