Interspeech'2005 - Eurospeech

Lisbon, Portugal
September 4-8, 2005

Lexical Out-of-Vocabulary Models for One-Stage Speech Interpretation

Matthias Thomae, Tibor Fabian, Robert Lieb, Günther Ruske

Technische Universität München, Germany

We present an approach to explicit, statistical, lexical-level out-ofvocabulary (OOV) word modeling for direct integration into the search space of a one-stage speech interpretation system. For this purpose, a generic pronunciation model for unknown words is derived from large pronunciation lexica and, optionally, word frequency knowledge. Known statistical language modeling (LM) methods are utilized to estimate different phoneme LM and apply different smoothing techniques. The resulting OOV word models are integrated with the hierarchical language model of our uniform modeling framework by declaring semantically irrelevant parts of the training utterances as unknown. Experiments were conducted with two different OOV training lexica on an airport information dialogue application, evaluating the results with both in-vocabulary (IV) and OOV-related metrics. Results for various OOV model configurations are presented, showing that OOV detection rates of 60-70% can be achieved with 1-2% falsely accepted IV words, simultaneously improving accuracy on the semantic representation.

Full Paper

Bibliographic reference.  Thomae, Matthias / Fabian, Tibor / Lieb, Robert / Ruske, Günther (2005): "Lexical out-of-vocabulary models for one-stage speech interpretation", In INTERSPEECH-2005, 441-444.