ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Large-scale Polish SLU

Patrick Lehnen, Stefan Hahn, Hermann Ney, Agnieszka Mykowiecka

In this paper, we present state-of-the art concept tagging results on a new corpus for Polish SLU. For this language, it is the first large-scale corpus (¡«200 different concepts) which has been semantically annotated and will be made publicly available. Conditional Random Fields have proven to lead to best results for string-to-string translation problems. Using this approach, we achieve a concept error rate of 22.6% on an evaluation corpus. To additionally extract attribute values, a combination of a statistical and a rule-based approach is used leading to a CER of 30.2%.


doi: 10.21437/Interspeech.2009-696

Cite as: Lehnen, P., Hahn, S., Ney, H., Mykowiecka, A. (2009) Large-scale Polish SLU. Proc. Interspeech 2009, 2723-2726, doi: 10.21437/Interspeech.2009-696

@inproceedings{lehnen09_interspeech,
  author={Patrick Lehnen and Stefan Hahn and Hermann Ney and Agnieszka Mykowiecka},
  title={{Large-scale Polish SLU}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2723--2726},
  doi={10.21437/Interspeech.2009-696}
}