7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

A Comparison of HTK, ISIP and Julius in Slovenian Large Vocabulary Continuous Speech Recognition

Tomaz Rotovnik, Mirjam Sepesy Maucec, Bogomir Horvat, Zdravko Kacic

University of Maribor, Slovenia

In this paper recognition results from different speech decoders are presented for Slovenian large vocabulary speech recognition task. For speech recognition two different types of lexica and language models were used. Word based models were used for baseline system and sub-word (stems and endings) based models for comparison. For all decoders a two-pass decoding strategy was used. With all three decoders better recognition results were achieved using stemending models (3% absolute on average). Experiments also showed slightly better recognition results with Julius decoder, as opposed to two other decoders, and improvement of real-time factor for 65%. Introduce Segmeantal Inner Timewarping into


Full Paper

Bibliographic reference.  Rotovnik, Tomaz / Maucec, Mirjam Sepesy / Horvat, Bogomir / Kacic, Zdravko (2002): "A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition", In ICSLP-2002, 681-684.