ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Eye tracking for the online evaluation of prosody in speech synthesis: not so fast!

Michael White, Rajakrishnan Rajkumar, Kiwako Ito, Shari R. Speer

This paper presents an eye-tracking experiment comparing the processing of different accent patterns in unit selection synthesis and human speech. The synthetic speech results failed to replicate the facilitative effect of contextually appropriate accent patterns found with human speech, while producing a more robust intonational garden-path effect with contextually inappropriate patterns, both of which could be due to processing delays seen with the synthetic speech. As the synthetic speech was of high quality, the results indicate that eye tracking holds promise as a highly sensitive and objective method for the online evaluation of prosody in speech synthesis.


doi: 10.21437/Interspeech.2009-665

Cite as: White, M., Rajkumar, R., Ito, K., Speer, S.R. (2009) Eye tracking for the online evaluation of prosody in speech synthesis: not so fast! Proc. Interspeech 2009, 2523-2526, doi: 10.21437/Interspeech.2009-665

@inproceedings{white09b_interspeech,
  author={Michael White and Rajakrishnan Rajkumar and Kiwako Ito and Shari R. Speer},
  title={{Eye tracking for the online evaluation of prosody in speech synthesis: not so fast!}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2523--2526},
  doi={10.21437/Interspeech.2009-665}
}