Evaluation of TTS systems is essential to assess performance. The ITUT P.85 standard was introduced in 1994 to assess the overall quality of speech synthesis systems. However it has not been widely accepted or used. This paper compares the ITU test to more commonly used tests for intelligibility (semantically unpredictable sentences (SUS)) and naturalness (mean opinion score based). The aim of this research was to determine if the ITU test can provide a better performance measure and/or supplementary information to help evaluate TTS systems.
Cite as: Sityaev, D., Knill, K., Burrows, T. (2006) Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems. Proc. Interspeech 2006, paper 1233-Tue2WeO.3, doi: 10.21437/Interspeech.2006-54
@inproceedings{sityaev06_interspeech, author={Dmitry Sityaev and Katherine Knill and Tina Burrows}, title={{Comparison of the ITU-t p.85 standard to other methods for the evaluation of text-to-speech systems}}, year=2006, booktitle={Proc. Interspeech 2006}, pages={paper 1233-Tue2WeO.3}, doi={10.21437/Interspeech.2006-54} }