Determining Native Language and Deception Using Phonetic Features and Classifier Combination

Gábor Gosztolya, Tamás Grósz, Róbert Busa-Fekete, László Tóth


For several years, the Interspeech ComParE Challenge has focused on paralinguistic tasks of various kinds. In this paper we focus on the Native Language and the Deception sub-challenges of ComParE 2016, where the goal is to identify the native language of the speaker, and to recognize deceptive speech. As both tasks can be treated as classification ones, we experiment with several state-of-the-art machine learning methods (Support-Vector Machines, AdaBoost. MH and Deep Neural Networks), and also test a simple-yet-robust combination method. Furthermore, we will assume that the native language of the speaker affects the pronunciation of specific phonemes in the language he is currently using. To exploit this, we extract phonetic features for the Native Language task. Moreover, for the Deception Sub-Challenge we compensate for the highly unbalanced class distribution by instance re-sampling. With these techniques we are able to significantly outperform the baseline SVM on the unpublished test set.


DOI: 10.21437/Interspeech.2016-962

Cite as

Gosztolya, G., Grósz, T., Busa-Fekete, R., Tóth, L. (2016) Determining Native Language and Deception Using Phonetic Features and Classifier Combination. Proc. Interspeech 2016, 2418-2422.

Bibtex
@inproceedings{Gosztolya+2016,
author={Gábor Gosztolya and Tamás Grósz and Róbert Busa-Fekete and László Tóth},
title={Determining Native Language and Deception Using Phonetic Features and Classifier Combination},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-962},
url={http://dx.doi.org/10.21437/Interspeech.2016-962},
pages={2418--2422}
}