ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Stochastic pronunciation modelling for spoken term detection

Dong Wang, Simon King, Joe Frankel

A major challenge faced by a spoken term detection (STD) system is the detection of out-of-vocabulary (OOV) terms. Although a subword-based STD system is able to detect OOV terms, performance reduction is always observed compared to in-vocabulary terms. Current approaches to STD do not acknowledge the particular properties of OOV terms, such as pronunciation uncertainty. In this paper, we use a stochastic pronunciation model to deal with the uncertain pronunciations of OOV terms. By considering all possible term pronunciations, predicted by a joint-multigram model, we observe a significant performance improvement.

doi: 10.21437/Interspeech.2009-610

Cite as: Wang, D., King, S., Frankel, J. (2009) Stochastic pronunciation modelling for spoken term detection. Proc. Interspeech 2009, 2135-2138, doi: 10.21437/Interspeech.2009-610

  author={Dong Wang and Simon King and Joe Frankel},
  title={{Stochastic pronunciation modelling for spoken term detection}},
  booktitle={Proc. Interspeech 2009},