ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Robust information extraction from spoken language data

David D. Palmery, Mari Ostendorf, John D. Burgerz

In this paper we address the problem of information extraction from speech data, particularly improving robustness to automatic recognition errors. We describe a baseline probabilistic model that uses wordclass smoothing in a phrase n-gram language model. The model is adjusted to the error characteristics of a speech recognizer by inserting error tokens in the training data and by using word confidences in decoding to account for possible errors in the recognition output. Experiments show improved performance when training and test conditions are matched.


doi: 10.21437/Eurospeech.1999-168

Cite as: Palmery, D.D., Ostendorf, M., Burgerz, J.D. (1999) Robust information extraction from spoken language data. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1035-1038, doi: 10.21437/Eurospeech.1999-168

@inproceedings{palmery99_eurospeech,
  author={David D. Palmery and Mari Ostendorf and John D. Burgerz},
  title={{Robust information extraction from spoken language data}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={1035--1038},
  doi={10.21437/Eurospeech.1999-168}
}