ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Integrating multiple knowledge sources for word hypotheses graph interpretation

Volker Warnke, Florian Gallwitz, Anton Batliner, Jan Buckow, R. Huber, Elmar Nöth, A. Höthker

We present an integrated approach for the interpretation of word hypotheses graphs (WHGs) using multiple knowledge sources. Commonly, different knowledge sources in speech understanding are applied sequentially. Typically, speech understanding systems, such as the {\vm} speech-to-speech translation system, first use a word recognizer to determine word hypotheses, only based on acoustic and language model (LM) information. The resulting word sequences or WHGs are then segmented according to syntactic and/or prosodic information. Finally, these segments are interpreted by a parser or a stochastic process. Thus, it is impossible to use the knowledge of the syntactic-prosodic process, the parser or any other subsequent process to find the best word sequence. In our new approach we use acoustic, prosodic and LM information to determine the best word chain, to detect syntactic/prosodic/pragmatic phrase boundaries and to classify dialog acts in one integrated search procedure, based on a WHG or a word lattice.


doi: 10.21437/Eurospeech.1999-62

Cite as: Warnke, V., Gallwitz, F., Batliner, A., Buckow, J., Huber, R., Nöth, E., Höthker, A. (1999) Integrating multiple knowledge sources for word hypotheses graph interpretation. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 235-238, doi: 10.21437/Eurospeech.1999-62

@inproceedings{warnke99_eurospeech,
  author={Volker Warnke and Florian Gallwitz and Anton Batliner and Jan Buckow and R. Huber and Elmar Nöth and A. Höthker},
  title={{Integrating multiple knowledge sources for word hypotheses graph interpretation}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={235--238},
  doi={10.21437/Eurospeech.1999-62}
}