A speech recognition system for the Polish language is described. The presentation will focus on an adjustment of the Kaldi toolkit for Polish, our own grapheme to phoneme conversion tool and a corpus of Polish we collected. The approaches to commercial applications will also be described.
Bibliographic reference. Ziółko, Bartosz / Jadczyk, Tomasz / Skurzok, Dawid / Żelasko, Piotr / Gałka, Jakub / Pȩdzimąż, Tomasz / Gawlik, Ireneusz / Pałka, Szymon (2015): "SARMATA 2.0 automatic Polish language speech recognition system", In INTERSPEECH-2015, 1062-1063.