Sixth International Conference on Spoken Language Processing
An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent training procedure for building a phonetic recogniser (http://www.telenor.no/fou/prosjekter/taletek/refrec). The reference recogniser relies on the HTK toolkit and a SpeechDat(II) compatible database, and is designed to serve as a reference system in multilingual speech recognition research. The paper describes version 0.96 of the reference recogniser which take into account labelled non-speech acoustic events during training and provides robustness against these during testing. Results are presented on small and medium vocabulary recognition for six languages.
Bibliographic reference. Lindberg, Børge / Johansen, Finn Tore / Warakagoda, Narada / Lehtinen, Gunnar / Kacic, Zdravko / Zgank, Andrej / Elenius, Kjell / Salvi, Giampiero (2000): "A NOISE ROBUST MULTILINGUAL REFERENCE RECOGNISER BASED ON SPEECHDAT(II)", In ICSLP-2000, vol.3, 370-373.