14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Diacritics Restoration for Arabic Dialect Texts

S. Harrat (1), M. Abbas (2), K. Meftouh (3), K. Smaili (4)

(1) ENS Bouzareah, Algeria
(2) CRSTDLA, Algeria
(3) Annaba University, Algeria
(4) LORIA, France

In this paper we present a statistical approach for automatic diacritization of Algiers dialectal texts. This approach is based on statistical machine translation. We first investigate this approach on Modern Standard Arabic (MSA) texts using several data sources and extrapolated the results on available dialectal texts. For evaluation we used word and diacritization error rates and also precision and recall.

Full Paper

Bibliographic reference.  Harrat, S. / Abbas, M. / Meftouh, K. / Smaili, K. (2013): "Diacritics restoration for Arabic dialect texts", In INTERSPEECH-2013, 1429-1433.