ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality

Michael Pucher, Peter Fröhlich

In this paper, we report on a comparative user study about the quality of mobile speech synthesis methods. We measured the impact of device class, data rate, synthesis method (diphone vs. non-uniform unit-selection) and lexicon usage on speech quality (word comprehension and several subjective satisfaction metrics). Seven practically relevant speech synthesis implementations and one natural voice were evaluated, applying the method recommended in ITU-T P.85, with additional pairwise comparisons. As a general result, although the overall subjective ratings of the synthetic voices differed significantly, the word comprehension rates were quite similar. We found a significant impact of data rate and synthesis method on the mean subjective speech quality, but not on word comprehension. The use of a lexicon in embedded speech synthesis slightly improved the perceived pronunciation quality.


doi: 10.21437/Interspeech.2005-780

Cite as: Pucher, M., Fröhlich, P. (2005) A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality. Proc. Interspeech 2005, 2501-2504, doi: 10.21437/Interspeech.2005-780

@inproceedings{pucher05_interspeech,
  author={Michael Pucher and Peter Fröhlich},
  title={{A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={2501--2504},
  doi={10.21437/Interspeech.2005-780}
}