EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

FST-Based Recognition Techniques for Multi-Lingual and Multi-Domain Spontaneous Speech

Timothy J. Hazen, I. Lee Hetherington, Alex Park

MIT Laboratory for Computer Science, USA

In this paper we present techniques for building multi-domain and multilingual recognizers within a finite-state transducer (FST) framework. The flexibility of the FST approach is also demonstrated on the task of incorporating networks modeling different types of non-speech events into an existing word lattice network. The ability to create robust multi-domain and/or multi-lingual recognizers for spontaneous speech will enable a conversational system to switch seamlessly and automatically among different domains and/or languages. Preliminary results using a bi-domain recognizer exhibit only small recognition accuracy degradation in comparison to domain-dependent recognition. Similarly promising results were observed using a bi-lingual recognizer which performs simultaneous language identification and recognition. When using the FST techniques to add non-speech models to the recognizer, experiments show a 10% reduction in word error rate across all utterances and a 30% reduction on utterances containing non-speech events.

Full Paper

Bibliographic reference.  Hazen, Timothy J. / Hetherington, I. Lee / Park, Alex (2001): "FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech", In EUROSPEECH-2001, 1591-1594.