An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent training procedure for building a phonetic recogniser (http://www.telenor.no/fou/prosjekter/taletek/refrec). The reference recogniser relies on the HTK toolkit and a SpeechDat(II) compatible database, and is designed to serve as a reference system in multilingual speech recognition research. The paper describes version 0.96 of the reference recogniser which take into account labelled non-speech acoustic events during training and provides robustness against these during testing. Results are presented on small and medium vocabulary recognition for six languages.
Cite as: Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kacic, Z., Zgank, A., Elenius, K., Salvi, G. (2000) A noise robust multilingual reference recogniser based on SPEECHDAT(II). Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 370-373, doi: 10.21437/ICSLP.2000-553
@inproceedings{lindberg00b_icslp, author={Børge Lindberg and Finn Tore Johansen and Narada Warakagoda and Gunnar Lehtinen and Zdravko Kacic and Andrej Zgank and Kjell Elenius and Giampiero Salvi}, title={{A noise robust multilingual reference recogniser based on SPEECHDAT(II)}}, year=2000, booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)}, pages={vol. 3, 370-373}, doi={10.21437/ICSLP.2000-553} }