ISCA Archive Odyssey 2008
ISCA Archive Odyssey 2008

Detecting nonnative speech using speaker recognition approaches

Elizabeth Shriberg, Luciana Ferrer, Sachin Kajarekar, Nicolas Scheffer, Andreas Stolcke, Murat Akbacak

Detecting whether a talker is speaking his native language is useful for speaker recognition, speech recognition, and intelligence applications. We study the problem of detecting nonnative speakers of American English, using two standard speech corpora. We apply approaches effective in speaker verification to this task, including systems based on MLLR, phone N-gram, prosodic, and word Ngram features. Results show equal error rates between 12% and 20%, depending on the system, test data, and choice of training data. Asymmetries in performance are most likely explained by differences in native language distributions in the corpora. Model combination yields substantial improvements over individual models, with the best result being around 8.6% EER. While phone Ngrams are widely used in related tasks (e.g., language and dialect ID), we find that it is the least effective model in combination; MLLR, prosody, and word N-gram systems play stronger roles. Overall, results suggest that individual systems and system combinations found useful for speaker ID also offer promise for nonnativeness detection, and that further efforts are warranted in this area.

Cite as: Shriberg, E., Ferrer, L., Kajarekar, S., Scheffer, N., Stolcke, A., Akbacak, M. (2008) Detecting nonnative speech using speaker recognition approaches. Proc. The Speaker and Language Recognition Workshop (Odyssey 2008), paper 26

  author={Elizabeth Shriberg and Luciana Ferrer and Sachin Kajarekar and Nicolas Scheffer and Andreas Stolcke and Murat Akbacak},
  title={{Detecting nonnative speech using speaker recognition approaches}},
  booktitle={Proc. The Speaker and Language Recognition Workshop (Odyssey 2008)},
  pages={paper 26}