In this paper, we introduce a word graph interface between speech and natural language processing systems within a flexible speech understanding framework based on stochastic concept modeling augmented with background "filler" models. Each concept represents a set of phrases ( written as a context free grammar (CFG)) with the same meaning, and is compiled into a stochastic recursive transition network (SRTN). The arcs (or rules) are tagged with probabilities after training. The filler models are used for phrases that are not covered by the concept networks. The structure in concept+filler sequences is captured by n-grams. The interface is implemented within the context of CU Communicator spoken dialog system. We investigate the effect of several different filler models and interpolation of complementary language models on the system performance. We report notable performance improvements compared to the baseline system. The gain in performance along with the efficiency and flexibility of the method motivates future work on the implementation of a tighter interface.
Cite as: Hacioglu, K., Ward, W. (2001) A word graph interface for a flexible concept based speech understanding framework. Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1775-1778, doi: 10.21437/Eurospeech.2001-419
@inproceedings{hacioglu01_eurospeech, author={Kadri Hacioglu and Wayne Ward}, title={{A word graph interface for a flexible concept based speech understanding framework}}, year=2001, booktitle={Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)}, pages={1775--1778}, doi={10.21437/Eurospeech.2001-419} }