September 22-25, 1997
A major challenge in speech recognition based on acoustic subword units is creating a lexicon which is robust to inter- and intra-speaker variations. In this paper we present two different approaches for incorporating simple word-level linguistic knowledge into the labelling step of the training procedure. The proposed systems also utilise a scheme for combined optimisation of baseforms and subword models. For the TI46 database, these methods are shown to greatly improve the performance compared to an acoustic subword based speech recogniser employing unsupervised labelling, and they are found to perform as well as systems utilising whole-word models and context independent phoneme models.
Bibliographic reference. Holter, Trym / Svendsen, Torbjorn (1997): "Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition", In EUROSPEECH-1997, 1159-1162.