ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Improved lexicon formation through removal of co-articulation and acoustic recognition errors

Philip Hanna, Darryl Stewart, Ji Ming, F. Jack Smith

It is becoming increasingly more necessary that speech recognition systems contain an accurate lexicon, consisting of likely word pronunciations that actually occur within a given domain. Given the increasing size of speech databases, it would appear that data driven approaches are best suited to derive such pronunciations. Presently, however, such an approach often introduces implausible pronunciations, resulting in a higher degree of confusability within the decoder. In this paper, we outline a novel data driven approach which aims to improve the quality of extracted word pronunciations through the removal of co-articulation effects and acoustic model misclassifications from the speech data. A number of selection constraints are additionally employed to exclude any improbable pronunciation alternatives. Initial experiments have shown that the approach does indeed provide plausible pronunciation alternatives without introducing improbable pronunciations.


Cite as: Hanna, P., Stewart, D., Ming, J., Smith, F.J. (2000) Improved lexicon formation through removal of co-articulation and acoustic recognition errors. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 50-53

@inproceedings{hanna00_icslp,
  author={Philip Hanna and Darryl Stewart and Ji Ming and F. Jack Smith},
  title={{Improved lexicon formation through removal of co-articulation and acoustic recognition errors}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 50-53}
}