Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages

Harry Bleyan, Sandy Ritchie, Jonas Fromseier Mortensen, Daan van Esch


We discuss two methods that let us easily create grapheme-to-phoneme (G2P) conversion systems for languages without any human-curated pronunciation lexicons, as long as we know the phoneme inventory of the target language and as long as we have some pronunciation lexicons for other languages written in the same script. We use these resources to infer what grapheme-to-phoneme correspondences we would expect, and predict pronunciations for words in the target language with minimal or no language-specific human work. Our first approach uses finite-state transducers, while our second approach uses a sequence-to-sequence neural network. Our G2P models reach high degrees of accuracy, and can be used for various applications, e.g. in developing an automatic speech recognition system. Our methods greatly simplify a task that has historically required extensive manual labor.


 DOI: 10.21437/Interspeech.2019-1781

Cite as: Bleyan, H., Ritchie, S., Mortensen, J.F., Esch, D.V. (2019) Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages. Proc. Interspeech 2019, 2100-2104, DOI: 10.21437/Interspeech.2019-1781.


@inproceedings{Bleyan2019,
  author={Harry Bleyan and Sandy Ritchie and Jonas Fromseier Mortensen and Daan van Esch},
  title={{Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={2100--2104},
  doi={10.21437/Interspeech.2019-1781},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1781}
}