ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Soft decisions in missing data techniques for robust automatic speech recognition

Jon Barker, Ljubomir Josifovski, Martin Cooke, Phil Green

In previous work we have developed the theory and demonstrated the promise of the Missing Data approach to robust Automatic Speech Recognition. This technique is based on hard decisions as to whether each time-frequency "pixel" is either reliable or unreliable. In this paper we replace these discrete decisions with soft estimates of the probability that each "pixel" is reliable. We adapt the probability calculation to use these estimates as weighting factors for the complementary reliable/unreliable interpretations for each feature vector component. Experiments using the TIDigits connected digit recognition task demonstrate that this technique a ords significant performance improvements at low SNRs.


Cite as: Barker, J., Josifovski, L., Cooke, M., Green, P. (2000) Soft decisions in missing data techniques for robust automatic speech recognition. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 373-376

@inproceedings{barker00_icslp,
  author={Jon Barker and Ljubomir Josifovski and Martin Cooke and Phil Green},
  title={{Soft decisions in missing data techniques for robust automatic speech recognition}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 373-376}
}