ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Continuous speech recognition using attention shift decoding with soft decision

Ozlem Kalinli, Shrikanth S. Narayanan

We present an attention shift decoding (ASD) method inspired by human speech recognition. In contrast to the traditional automatic speech recognition (ASR) systems, ASD decodes speech inconsecutively using reliability criteria; the gaps (unreliable speech regions) are decoded with the evidence of islands (reliable speech regions). On the BU Radio News Corpus, ASD provides significant improvement (2.9% absolute) over the baseline ASR results when it is used with oracle island-gap information. At the core of the ASD method is the automatic island-gap detection. Here, we propose a new feature set for automatic island-gap detection which achieves 83.7% accuracy. To cope with the imperfect nature of the island-gap classification, we also propose a new ASD algorithm using soft decision. The ASD with soft decision provides 0.4% absolute (2.2% relative) improvement over the baseline ASR results when it is used with automatically detected islands and gaps.


doi: 10.21437/Interspeech.2009-557

Cite as: Kalinli, O., Narayanan, S.S. (2009) Continuous speech recognition using attention shift decoding with soft decision. Proc. Interspeech 2009, 1927-1930, doi: 10.21437/Interspeech.2009-557

@inproceedings{kalinli09_interspeech,
  author={Ozlem Kalinli and Shrikanth S. Narayanan},
  title={{Continuous speech recognition using attention shift decoding with soft decision}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={1927--1930},
  doi={10.21437/Interspeech.2009-557}
}