ASR for South Slavic Languages Developed in Almost Automated Way

Jan Nouza, Radek Safarik, Petr Cerva


Slavic languages pose several specific challenges that need to be addressed in an ASR system design. Since we have already built an engine suited for highly-inflected languages, we focus on adopting it for new languages, now. In this case, we present an efficient way to adapt the system to all (seven) South Slavic languages, using methods and tools that benefit from language similarities, easily adjustable G2P rules or common phonetic subsets. We show that it is possible to build accurate language and acoustic models in an almost automated way, entirely from resources found on the web. The AMs are trained via cross-lingual bootstrapping followed by lightly supervised retraining from public data, like broadcast and parliament archives. Tests done on a set of main broadcast news in each language show WER values in range 16.8 to 21.5%, which includes also errors caused by OOL (out-of-language) utterances often occurring in this type of spoken programs.


DOI: 10.21437/Interspeech.2016-747

Cite as

Nouza, J., Safarik, R., Cerva, P. (2016) ASR for South Slavic Languages Developed in Almost Automated Way. Proc. Interspeech 2016, 3868-3872.

Bibtex
@inproceedings{Nouza+2016,
author={Jan Nouza and Radek Safarik and Petr Cerva},
title={ASR for South Slavic Languages Developed in Almost Automated Way},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-747},
url={http://dx.doi.org/10.21437/Interspeech.2016-747},
pages={3868--3872}
}