ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR

Penny Karanasou, Fran├žois Yvon, Thomas Lavergne, Lori Lamel

To enhance the recognition lexicon, it is important to be able to add pronunciation variants while keeping the confusability introduced by the extra phonemic variation low. However, this confusability is not easily correlated with the ASR performance, as it is an inherent phenomenon of speech. This paper proposes a method to construct a multiple pronunciation lexicon with a high discriminability. To do so, a phoneme confusion model is used to expand the phonemic search space of pronunciation variants during ASR decoding and a discriminative framework is adopted for the training of the weights of the phoneme confusions. For the parameter estimation, two training algorithms are implemented, the perceptron and the CRF model, using finite state transducers. Experiments on English data were conducted using a large stateof- the-art ASR system of continuous speech.


doi: 10.21437/Interspeech.2013-468

Cite as: Karanasou, P., Yvon, F., Lavergne, T., Lamel, L. (2013) Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR. Proc. Interspeech 2013, 1966-1970, doi: 10.21437/Interspeech.2013-468

@inproceedings{karanasou13_interspeech,
  author={Penny Karanasou and Fran├žois Yvon and Thomas Lavergne and Lori Lamel},
  title={{Discriminative training of a phoneme confusion model for a dynamic lexicon in ASR}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={1966--1970},
  doi={10.21437/Interspeech.2013-468}
}