10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Stochastic Pronunciation Modelling for Spoken Term Detection

Dong Wang, Simon King, Joe Frankel

University of Edinburgh, UK

A major challenge faced by a spoken term detection (STD) system is the detection of out-of-vocabulary (OOV) terms. Although a subword-based STD system is able to detect OOV terms, performance reduction is always observed compared to in-vocabulary terms. Current approaches to STD do not acknowledge the particular properties of OOV terms, such as pronunciation uncertainty. In this paper, we use a stochastic pronunciation model to deal with the uncertain pronunciations of OOV terms. By considering all possible term pronunciations, predicted by a joint-multigram model, we observe a significant performance improvement.

Full Paper

Bibliographic reference.  Wang, Dong / King, Simon / Frankel, Joe (2009): "Stochastic pronunciation modelling for spoken term detection", In INTERSPEECH-2009, 2135-2138.