ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

On designing and evaluating speech event detectors

Jinyu Li, Chin-Hui Lee

We study issues related to designing speech event detectors for automatic speech recognition. Event detection is a critical component of a recently proposed automatic speech attribute transcription (ASAT) paradigm for speech research. Similar to keyword spotting and non-keyword rejection, a good detector needs to effectively detect speech attributes of interest while rejecting extraneous events. We compare frame and segment based detectors, study their properties in detecting manners of articulation, and propose new performance measures. We test these detectors on the TIMIT database with several evaluation criteria. Our results indicate that segment based detectors outperform frame based detectors in several key aspects of speech detector design. We also show that the performance can be significantly enhanced by incorporating discriminative training into designing speech event detectors.


doi: 10.21437/Interspeech.2005-864

Cite as: Li, J., Lee, C.-H. (2005) On designing and evaluating speech event detectors. Proc. Interspeech 2005, 3365-3368, doi: 10.21437/Interspeech.2005-864

@inproceedings{li05h_interspeech,
  author={Jinyu Li and Chin-Hui Lee},
  title={{On designing and evaluating speech event detectors}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={3365--3368},
  doi={10.21437/Interspeech.2005-864}
}