10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition

Long Nguyen (1), Tim Ng (1), Kham Nguyen (2), Rabih Zbib (3), John Makhoul (1)

(1) BBN Technologies, USA
(2) Northeastern University, USA
(3) MIT, USA

In this paper, we describe the use of either words or morphemes as lexical modeling units and the use of either graphemes or phonemes as phonetic modeling units for Arabic automatic speech recognition (ASR). We designed four Arabic ASR systems: two wordbased systems and two morpheme-based systems. Experimental results using these four systems show that they have comparable state-of-the-art performance individually, but the more sophisticated morpheme-based system tends to be the best. However, they seem to complement each other quite well within the ROVER system combination framework to produce substantially-improved combined results.

Full Paper

Bibliographic reference.  Nguyen, Long / Ng, Tim / Nguyen, Kham / Zbib, Rabih / Makhoul, John (2009): "Lexical and phonetic modeling for Arabic automatic speech recognition", In INTERSPEECH-2009, 712-715.