ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Pronunciation variation in ASR: which variation to model?

Mirjam Wester, Judith M. Kessens, Helmer Strik

This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling within-word and cross-word pronunciation variation. A relative improvement of 8.8% in WER was found compared to baseline system performance. However, as WERs do not reveal the full effect of modeling pronunciation variation, we performed a detailed analysis of the differences in recognition results that occur due to modeling pronunciation variation and found that indeed a lot of the differences in recognition results are not reflected in the error rates. Furthermore, error analysis revealed that testing sets of variants in isolation does not predict their behavior in combination. However, these results appeared to be corpus dependent.


doi: 10.21437/ICSLP.2000-855

Cite as: Wester, M., Kessens, J.M., Strik, H. (2000) Pronunciation variation in ASR: which variation to model? Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 4, 488-491, doi: 10.21437/ICSLP.2000-855

@inproceedings{wester00b_icslp,
  author={Mirjam Wester and Judith M. Kessens and Helmer Strik},
  title={{Pronunciation variation in ASR: which variation to model?}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 4, 488-491},
  doi={10.21437/ICSLP.2000-855}
}