INTERSPEECH 2010
11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Augmented Set of Features for Confidence Estimation in Spoken Term Detection

Javier Tejedor (1), Doroteo T. Toledano (1), Miguel Bautista (1), Simon King (2), Dong Wang (2), José Colás (1)

(1) Universidad Autónoma de Madrid, Spain
(2) University of Edinburgh, UK

Discriminative confidence estimation along with confidence normalisation have been shown to construct robust decision maker modules in spoken term detection (STD) systems. Discriminative confidence estimation, making use of termdependent features, has been shown to improve the widely used lattice-based confidence estimation in STD. In this work, we augment the set of these term-dependent features and show a significant improvement in the STD performance both in terms of ATWV and DET curves in experiments conducted on a Spanish geographical corpus. This work also proposes a multiple lineal regression analysis to carry out the feature selection. Next, the most informative features derived from it are used within the discriminative confidence on the STD system.

Full Paper

Bibliographic reference.  Tejedor, Javier / Toledano, Doroteo T. / Bautista, Miguel / King, Simon / Wang, Dong / Colás, José (2010): "Augmented set of features for confidence estimation in spoken term detection", In INTERSPEECH-2010, 701-704.