ISCA Archive SLTU 2012
ISCA Archive SLTU 2012

Automatic speech recognition system for under-resourced languages based on Speeral: application to berber language

Z. Benkhellat, E. Ferreira, Pascal Nocera, M. Guerti

The ability to collect and process a large amount of resources (vocabularies, text corpora, transcribed speech corpora, phonetic dictionaries) constitutes a critical prerequisite of systems based on statistical methods. This problem becomes crucial for languages presenting a lack of computer resources, also known as under-resourced languages, such as African ones. Our work consists in finding an efficient methodology which can improve Speech recognition systems for this kind of languages. This article presents a possible solution proposed for the Berber Language and describe the set of tools used in our study. Namely, we dealt with the problem of insufficient amount of resources by taking into account linguistic specificities of the Berber language and using innovative methods in the building process of ASR resources (acoustic model, lexicon and language model).

Index Terms: Speech recognition, berber language, speeral, under-resourced language


Cite as: Benkhellat, Z., Ferreira, E., Nocera, P., Guerti, M. (2012) Automatic speech recognition system for under-resourced languages based on Speeral: application to berber language. Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012), 152-155

@inproceedings{benkhellat12_sltu,
  author={Z. Benkhellat and E. Ferreira and Pascal Nocera and M. Guerti},
  title={{Automatic speech recognition system for under-resourced languages based on Speeral: application to berber language}},
  year=2012,
  booktitle={Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012)},
  pages={152--155}
}