EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

FLaVoR: A Flexible Architecture for LVCSR

Kris Demuynck, Tom Laureys, Dirk van Compernolle, Hugo van Hamme

Katholieke Universiteit Leuven, Belgium

This paper describes a new architecture for large vocabulary continuous speech recognition (LVCSR), which will be developed within the project FLaVoR (Flexible Large Vocabulary Recognition). The proposed architecture abandons the standard all-in-one search strategy with integrated acoustic, lexical and language model information. Instead, a modular framework is proposed which allows for the integration of more complex linguistic components. The search process consists of two layers. First, a pure acoustic-phonemic search generates a dense phoneme network enriched with meta-data. Then, the output of the first layer is used by sophisticated language technology components for word decoding in the second layer. Preliminary experiments prove the feasibility of the approach.

Full Paper

Bibliographic reference.  Demuynck, Kris / Laureys, Tom / Compernolle, Dirk van / Hamme, Hugo van (2003): "FLavor: a flexible architecture for LVCSR", In EUROSPEECH-2003, 1973-1976.