A speech recognition system for the Polish language is described. The presentation will focus on an adjustment of the Kaldi toolkit for Polish, our own grapheme to phoneme conversion tool and a corpus of Polish we collected. The approaches to commercial applications will also be described.
Cite as: Ziółko, B., Jadczyk, T., Skurzok, D., Żelasko, P., Gałka, J., Pȩdzimąż, T., Gawlik, I., Pałka, S. (2015) SARMATA 2.0 automatic Polish language speech recognition system. Proc. Interspeech 2015, 1062-1063
@inproceedings{zioko15_interspeech, author={Bartosz Ziółko and Tomasz Jadczyk and Dawid Skurzok and Piotr Żelasko and Jakub Gałka and Tomasz Pȩdzimąż and Ireneusz Gawlik and Szymon Pałka}, title={{SARMATA 2.0 automatic Polish language speech recognition system}}, year=2015, booktitle={Proc. Interspeech 2015}, pages={1062--1063} }