ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Robust keyword spotting with rapidly adapting point process models

Aren Jansen, Partha Niyogi

In this paper, we investigate the noise robustness properties of frame-based and sparse point process-based models for spotting keywords in continuous speech. We introduce a new strategy to improve point process model (PPM) robustness by adapting low-level feature detector thresholds to preserve background firing rates in the presence of noise. We find that this unsupervised approach can significantly outperform fully supervised maximum likelihood linear regression (MLLR) adaptation of an equivalent keyword-filler HMM system in the presence of additive white and pink noise. Moreover, we find that the sparsity of PPMs introduces an inherent resilience to non-stationary babble noise not exhibited by the frame-based HMM system. Finally, we demonstrate that our approach requires less adaptation data than MLLR, permitting rapid online adaptation.


doi: 10.21437/Interspeech.2009-707

Cite as: Jansen, A., Niyogi, P. (2009) Robust keyword spotting with rapidly adapting point process models. Proc. Interspeech 2009, 2767-2770, doi: 10.21437/Interspeech.2009-707

@inproceedings{jansen09_interspeech,
  author={Aren Jansen and Partha Niyogi},
  title={{Robust keyword spotting with rapidly adapting point process models}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2767--2770},
  doi={10.21437/Interspeech.2009-707}
}