ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A study of implicit and explicit modeling of coarticulation and pronunciation variation

St├ęphane Dupont, Christophe Ris, Laurent Couvreur, Jean-Marc Boite, Jean-Marc Boite

In this paper, we focus on the modeling of coarticulation and pronunciation variation in Automatic Speech Recognition systems (ASR). Most ASR systems explicitly describe these production phenomena through context-dependent phoneme models and multiple pronunciation lexicons.

Here, we explore the potential benefit of using feature spaces covering longer time segments in terms of implicit modeling of coarticulation and pronunciation variants.

The study is based on the analysis at the phonetic level of the performance of context-independent and context-dependent acoustic models, and more particularly the impact of modeling different time context going from 70 ms up to 310 ms on typical cases of pronunciation variants.

Results, confirmed by word recognition experiment, put into light some ability of generic acoustic models to implicitly handle pronunciation variation.


doi: 10.21437/Interspeech.2005-490

Cite as: Dupont, S., Ris, C., Couvreur, L., Boite, J.-M., Boite, J.-M. (2005) A study of implicit and explicit modeling of coarticulation and pronunciation variation. Proc. Interspeech 2005, 1353-1356, doi: 10.21437/Interspeech.2005-490

@inproceedings{dupont05_interspeech,
  author={St├ęphane Dupont and Christophe Ris and Laurent Couvreur and Jean-Marc Boite and Jean-Marc Boite},
  title={{A study of implicit and explicit modeling of coarticulation and pronunciation variation}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={1353--1356},
  doi={10.21437/Interspeech.2005-490}
}