12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

A Qualitative Evaluation of Phoneme-to-Phoneme Technology

Marijn Schraagen, Gerrit Bloothooft

Universiteit Utrecht, The Netherlands

Automatic speech recognition systems apply grapheme-tophoneme transcription (G2P) to model pronunciation of items in the lexicon. General purpose G2P transcriptions are not always accurate, e.g., in a multilingual environment. To improve the transcription quality, G2P transcriptions can be postprocessed using a phoneme-to-phoneme (P2P) converter. This paper discusses the applicability of P2P technology based on results of a speech recognition experiment using P2P conversion on a multilingual speech corpus. P2P conversion can be applied successfully, however the analysis also shows limitations of P2P technology.

Full Paper

Bibliographic reference.  Schraagen, Marijn / Bloothooft, Gerrit (2011): "A qualitative evaluation of phoneme-to-phoneme technology", In INTERSPEECH-2011, 2321-2324.