EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Context-dependent Probabilistic Hierarchical Sublexical Modelling Using Finite State Transducers

Xiaolong Mou, Stephanie Seneff, Victor Zue

MIT Laboratory for Computer Science, USA

This paper describes a unified architecture for integrating sub-lexical models with speech recognition, and a layered framework for contextdependent probabilistic hierarchical sub-lexical modelling using finite state transducers. Our major motivation for designing a unified architecture is to provide a framework such that probabilistic sublexical linguistic components can be integrated with other speech recognition components without sacrificing the flexibilities of their independent developments and configurations. We are also able to obtain a tightly coupled interface between recognizers and sub-lexical components. We present a view of using layered probabilistic models to augment contextfree grammars (CFGs). It captures context-dependent probabilistic information beyond the standard CFG formalism, and provides the flexibility of developing suitable probabilistic models independently for each sub-lexical layer. Experimental results show that such an approach can achieve comparable performance to pronunciation network approaches on in-vocabulary utterances, while being able to substantially reduce errors on utterances with previously unseen words.

Full Paper

Bibliographic reference.  Mou, Xiaolong / Seneff, Stephanie / Zue, Victor (2001): "Context-dependent probabilistic hierarchical sublexical modelling using finite state transducers", In EUROSPEECH-2001, 451-454.