ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Data-driven lexical modeling of pronunciation variations for ASR

Qian Yang, Jean-Pierre Martens

In this paper a method for the automatic construction of a lexicon with multiple entries per word is described. The basic idea is to transform a reference word transcription by means of stochastic pronunciation rules that can be learned automatically. This approach already proved its potential (Cremelie & Martens, 1999), and is now brought to a much higher level of performance. Relative reductions of the word error rate (WER) of 20 % (open vocabulary) to 45 % (closed vocabulary) are now within reach.

N. Cremelie, J.P. Martens. "In search for better pronunciation models for speech recognition," Speech Communication 29, 115-136, 1999.


doi: 10.21437/ICSLP.2000-103

Cite as: Yang, Q., Martens, J.-P. (2000) Data-driven lexical modeling of pronunciation variations for ASR. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 417-420, doi: 10.21437/ICSLP.2000-103

@inproceedings{yang00c_icslp,
  author={Qian Yang and Jean-Pierre Martens},
  title={{Data-driven lexical modeling of pronunciation variations for ASR}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 417-420},
  doi={10.21437/ICSLP.2000-103}
}