INTERSPEECH 2013
14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

Person Name Spotting by Combining Acoustic Matching and LDA Topic Models

Grégory Senay, Benjamin Bigot, Richard Dufour, Georges Linarès, Corinne Fredouille

LIA, France

In this article, we are interested in spoken term detection task, with a particular focus on Person Name (PN) spotting in automatic speech recognition (ASR) system outputs. We propose a two-step method that combines an acoustic matching based on a Phoneme Confusion Network (PCN) with a semantic rescoring based on the Latent Dirichlet Allocation (LDA) models. The first module allows to find, in the PCN, potential PN candidates in speech segments, while the second is in charge of ranking the competing PN, according to a LDA topic model. The proposed LDA-based approach outperforms significantly the baseline system based on a search in the ASR phoneme lattice, obtaining a F-measure score of 77.04% on PN detection.

Full Paper

Bibliographic reference.  Senay, Grégory / Bigot, Benjamin / Dufour, Richard / Linarès, Georges / Fredouille, Corinne (2013): "Person name spotting by combining acoustic matching and LDA topic models", In INTERSPEECH-2013, 1584-1588.