Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PMLA)

September 14-15, 2002
Aspen Lodge, Estes Park, Colorado, USA

A Novel Approach to Unsupervised Grapheme-to-Phoneme Conversion

Jerome R. Bellegarda

Spoken Language Group, Apple Computer, Cupertino, CA, USA

Automatic, data-driven grapheme-to-phoneme conversion is a challenging but often necessary task. The top-down strategy implicitly adopted by traditional inductive learning techniques tends to dismiss relevant contexts when they have been seen too infrequently in the training data. This paper proposes instead a bottom-up approach which, by design, exhibits better generalization properties. For each out-ofvocabulary word, a neighborhood of locally relevant pronunciations is constructed through latent semantic analysis of the appropriate graphemic form. Phoneme transcription then proceeds via locally optimal sequence alignment and maximum likelihood position scoring. This method was successfully applied to the speech synthesis of proper names with a large diversity of origin.


Full Paper

Bibliographic reference.  Bellegarda, Jerome R. (2002): "A novel approach to unsupervised grapheme-to-phoneme conversion", In PMLA-2002, 95-98.