ITRW on Non-Linear Speech Processing
(NOLISP 07)

Paris, France
May 22-25, 2007

Discriminative Keyword Spotting

Joseph Keshet (1), David Grangier (2), Samy Bengio (2)

(1) School of Computer Science & Engineering, The Hebrew University, Jerusalem, Israel
(2) IDIAP Research Institute, Martigny, Switzerland

This paper proposes a new approach for keyword spotting, which is not based on HMMs. The proposed method employs a new discriminative learning procedure, in which the learning phase aims at maximizing the area under the ROC curve, as this quantity is the most common measure to evaluate keyword spotters. The keyword spotter we devise is based on nonlinearly mapping the input acoustic representation of the speech utterance along with the target keyword into an abstract vector space. Building on techniques used for large margin methods for predicting whole sequences, our keyword spotter distills to a classifier in the abstract vector-space which separates speech utterances in which the keyword was uttered from speech utterances in which the keyword was not uttered. We describe a simple iterative algorithm for learning the keyword spotter and discuss its formal properties. Experiments with the TIMIT corpus show that our method outperforms the conventional HMMbased approach.

Full Paper

Bibliographic reference.  Keshet, Joseph / Grangier, David / Bengio, Samy (2007): "Discriminative keyword spotting", In NOLISP-2007, 47-50.