ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Person name spotting by combining acoustic matching and LDA topic models

Grégory Senay, Benjamin Bigot, Richard Dufour, Georges Linarès, Corinne Fredouille

In this article, we are interested in spoken term detection task, with a particular focus on Person Name (PN) spotting in automatic speech recognition (ASR) system outputs. We propose a two-step method that combines an acoustic matching based on a Phoneme Confusion Network (PCN) with a semantic rescoring based on the Latent Dirichlet Allocation (LDA) models. The first module allows to find, in the PCN, potential PN candidates in speech segments, while the second is in charge of ranking the competing PN, according to a LDA topic model. The proposed LDA-based approach outperforms significantly the baseline system based on a search in the ASR phoneme lattice, obtaining a F-measure score of 77.04% on PN detection.


doi: 10.21437/Interspeech.2013-399

Cite as: Senay, G., Bigot, B., Dufour, R., Linarès, G., Fredouille, C. (2013) Person name spotting by combining acoustic matching and LDA topic models. Proc. Interspeech 2013, 1584-1588, doi: 10.21437/Interspeech.2013-399

@inproceedings{senay13_interspeech,
  author={Grégory Senay and Benjamin Bigot and Richard Dufour and Georges Linarès and Corinne Fredouille},
  title={{Person name spotting by combining acoustic matching and LDA topic models}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1584--1588},
  doi={10.21437/Interspeech.2013-399}
}