ISCA Archive PMLA 2002
ISCA Archive PMLA 2002

Pronunciation variation modeling for asr: large improvements are possible but small ones are likely

Qian Yang, Jean-Pierre Martens, Pieter-Jan Ghesquiere, Dirk Van Compernolle

In this paper a previously proposed method for the automatic construction of a lexicon with pronunciation variants for ASR is further developed and evaluated. The basic idea is to transform a lexicon of canonical forms by means of rewrite rules that are learned automatically on a training corpus of orthographically transcribed utterances. The method is evaluated on the TIMIT corpus, using a speech recognizer incorporating context-independent HMMs and a bigram language model. It appears that reductions of the word error rate of up to 35 % are possible to achieve. However, it also appears that it is more likely to obtain much lower gains.


Cite as: Yang, Q., Martens, J.-P., Ghesquiere, P.-J., Compernolle, D.V. (2002) Pronunciation variation modeling for asr: large improvements are possible but small ones are likely. Proc. ITRW on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PMLA 2002), 123-128

@inproceedings{yang02_pmla,
  author={Qian Yang and Jean-Pierre Martens and Pieter-Jan Ghesquiere and Dirk Van Compernolle},
  title={{Pronunciation variation modeling for asr: large improvements are possible but small ones are likely}},
  year=2002,
  booktitle={Proc. ITRW on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PMLA 2002)},
  pages={123--128}
}