Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

A NOISE ROBUST MULTILINGUAL REFERENCE RECOGNISER BASED ON SPEECHDAT(II)

Børge Lindberg (1), Finn Tore Johansen (2), Narada Warakagoda (2), Gunnar Lehtinen (3), Zdravko Kacic (4), Andrej Zgank (4), Kjell Elenius (5), Giampiero Salvi (5)

(1) Center for PersonKommunikation (CPK), Aalborg, Denmark
(2) Telenor R&D, Kjeller, Norway
(3) Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
(4) University of Maribor, Slovenia
(5) Kungliga Tekniska Högskolan (KTH), Stockholm, Sweden

An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent training procedure for building a phonetic recogniser (http://www.telenor.no/fou/prosjekter/taletek/refrec). The reference recogniser relies on the HTK toolkit and a SpeechDat(II) compatible database, and is designed to serve as a reference system in multilingual speech recognition research. The paper describes version 0.96 of the reference recogniser which take into account labelled non-speech acoustic events during training and provides robustness against these during testing. Results are presented on small and medium vocabulary recognition for six languages.


Full Paper

Bibliographic reference.  Lindberg, Børge / Johansen, Finn Tore / Warakagoda, Narada / Lehtinen, Gunnar / Kacic, Zdravko / Zgank, Andrej / Elenius, Kjell / Salvi, Giampiero (2000): "A NOISE ROBUST MULTILINGUAL REFERENCE RECOGNISER BASED ON SPEECHDAT(II)", In ICSLP-2000, vol.3, 370-373.