ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Using prosodic information for disambiguation purposes

Roberto Gretter, Dino Seppi

In this work, we describe how prosodic information can be employed to improve the performance of an Automatic Speech Recognizer (ASR) for specific restricted tasks. The approach exploits additional prosodic information in a post-processing stage. Prosodic features are estimated at word level; this additional information is encoded through a feature extractor and is then modeled using a statistical classifier. To train and test this system we collected an Italian database designed to comprise specific dialogue problems like ambiguous utterances. The proposed system yields a 69.5% relative word error rate reduction compared to a traditional state-of-the-art recognizer for the task of recognizing sequences of numbers.


doi: 10.21437/Interspeech.2005-555

Cite as: Gretter, R., Seppi, D. (2005) Using prosodic information for disambiguation purposes. Proc. Interspeech 2005, 1821-1824, doi: 10.21437/Interspeech.2005-555

@inproceedings{gretter05_interspeech,
  author={Roberto Gretter and Dino Seppi},
  title={{Using prosodic information for disambiguation purposes}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={1821--1824},
  doi={10.21437/Interspeech.2005-555}
}