ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Dialect classification via discriminative training

Yun Lei, John H. L. Hansen

Variability in speech due to dialect is a major factor limiting speech system performance for speech recognition, spoken document retrieval, and dialog systems. In this study, we propose a novel discriminative algorithm to improve dialect classification for unsupervised spontaneous speech in Arabic. No transcripts are used for either training or testing, and all data are spontaneous speech. The Gaussian mixture model (GMM) is used as our baseline system for dialect classification. The major motivation is to remove confused/distractive regions from the dialect acoustic space, while emphasizing discriminative/sensitive information. The Kullback-Leibler divergence is used to find the most discriminative GMM mixtures (KLD-GMM), after which the confused acoustic GMM region is removed. The proposed algorithm is evaluated on three dialects of Arabic, with measurable improvement achieved (4.0%), over a generalized maximum likelihood estimation GMM baseline (MLE-GMM) system.

doi: 10.21437/Interspeech.2008-229

Cite as: Lei, Y., Hansen, J.H.L. (2008) Dialect classification via discriminative training. Proc. Interspeech 2008, 735-738, doi: 10.21437/Interspeech.2008-229

  author={Yun Lei and John H. L. Hansen},
  title={{Dialect classification via discriminative training}},
  booktitle={Proc. Interspeech 2008},