ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Analysis of dialectal influence in pan-Arabic ASR

Udhyakumar Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaaf, Tanja Schultz

In this paper, we analyze the impact of five Arabic dialects on the front-end and pronunciation dictionary components of an Automatic Speech Recognition (ASR) system. We use ASR's phonetic decision tree as a diagnostic tool to compare the robustness of MFCC and MLP front-ends to dialectal variations in the speech data and found that MLP Bottle-Neck features are less robust to such variations. We also perform a rule-based analysis of the pronunciation dictionary, which enables us to identify dialectal words in the vocabulary and automatically generate pronunciations for unseen words. We show that our technique produces pronunciations with an average phone error rate 9.2%.

doi: 10.21437/Interspeech.2011-191

Cite as: Nallasamy, U., Garbus, M., Metze, F., Jin, Q., Schaaf, T., Schultz, T. (2011) Analysis of dialectal influence in pan-Arabic ASR. Proc. Interspeech 2011, 1721-1724, doi: 10.21437/Interspeech.2011-191

  author={Udhyakumar Nallasamy and Michael Garbus and Florian Metze and Qin Jin and Thomas Schaaf and Tanja Schultz},
  title={{Analysis of dialectal influence in pan-Arabic ASR}},
  booktitle={Proc. Interspeech 2011},