ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A noise robust multilingual reference recogniser based on SPEECHDAT(II)

Børge Lindberg, Finn Tore Johansen, Narada Warakagoda, Gunnar Lehtinen, Zdravko Kacic, Andrej Zgank, Kjell Elenius, Giampiero Salvi

An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent training procedure for building a phonetic recogniser (http://www.telenor.no/fou/prosjekter/taletek/refrec). The reference recogniser relies on the HTK toolkit and a SpeechDat(II) compatible database, and is designed to serve as a reference system in multilingual speech recognition research. The paper describes version 0.96 of the reference recogniser which take into account labelled non-speech acoustic events during training and provides robustness against these during testing. Results are presented on small and medium vocabulary recognition for six languages.


doi: 10.21437/ICSLP.2000-553

Cite as: Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kacic, Z., Zgank, A., Elenius, K., Salvi, G. (2000) A noise robust multilingual reference recogniser based on SPEECHDAT(II). Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 370-373, doi: 10.21437/ICSLP.2000-553

@inproceedings{lindberg00b_icslp,
  author={Børge Lindberg and Finn Tore Johansen and Narada Warakagoda and Gunnar Lehtinen and Zdravko Kacic and Andrej Zgank and Kjell Elenius and Giampiero Salvi},
  title={{A noise robust multilingual reference recogniser based on SPEECHDAT(II)}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 370-373},
  doi={10.21437/ICSLP.2000-553}
}