Learning Personalized Pronunciations for Contact Name Recognition

Antoine Bruguier, Fuchun Peng, Françoise Beaufays


Automatic speech recognition that involves people’s names is difficult because names follow a long-tail distribution and they have no commonly accepted spelling or pronunciation. This poses significant challenges to contact dialing by voice. We propose using personalized pronunciation learning: people can use their own pronunciations for their contact names. We achieve this by implicitly learning from users’ corrections and within minutes making that pronunciation available for the next voice dialing. We show that personalized pronunciations significantly reduce word error for difficult contact names by 15% relatively.


DOI: 10.21437/Interspeech.2016-537

Cite as

Bruguier, A., Peng, F., Beaufays, F. (2016) Learning Personalized Pronunciations for Contact Name Recognition. Proc. Interspeech 2016, 3096-3100.

Bibtex
@inproceedings{Bruguier+2016,
author={Antoine Bruguier and Fuchun Peng and Françoise Beaufays},
title={Learning Personalized Pronunciations for Contact Name Recognition},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-537},
url={http://dx.doi.org/10.21437/Interspeech.2016-537},
pages={3096--3100}
}