ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Discriminative n-gram selection for dialect recognition

F. S. Richardson, W. M. Campbell, P. A. Torres-Carrasquillo

Dialect recognition is a challenging and multifaceted problem. Distinguishing between dialects can rely upon many tiers of interpretation of speech data — e.g., prosodic, phonetic, spectral, and word. High-accuracy automatic methods for dialect recognition typically use either phonetic or spectral characteristics of the input. A challenge with spectral system, such as those based on shifted-delta cepstral coefficients, is that they achieve good performance but do not provide insight into distinctive dialect features. In this work, a novel method based upon discriminative training and phone N-grams is proposed. This approach achieves excellent classification performance, fuses well with other systems, and has interpretable dialect characteristics in the phonetic tier. The method is demonstrated on data from the LDC and prior NIST language recognition evaluations. The method is also combined with spectral methods to demonstrate state-of-the-art performance in dialect recognition.


doi: 10.21437/Interspeech.2009-73

Cite as: Richardson, F.S., Campbell, W.M., Torres-Carrasquillo, P.A. (2009) Discriminative n-gram selection for dialect recognition. Proc. Interspeech 2009, 192-195, doi: 10.21437/Interspeech.2009-73

@inproceedings{richardson09_interspeech,
  author={F. S. Richardson and W. M. Campbell and P. A. Torres-Carrasquillo},
  title={{Discriminative n-gram selection for dialect recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={192--195},
  doi={10.21437/Interspeech.2009-73}
}