ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences

Markku Turunen, Jaakko Hakulinen, Anssi Kainulainen

We present findings from the long-term study of a speech-based bus timetable system. After the deployment of the prototype system we have collected data from real usage for 30 months. In addition, we have conducted usability tests to get subjective ratings of the pilot system. The comparison of these evaluations shows that the results obtained with usability tests differ significantly from those gained from the real usage, and the data of the initial use differs significantly from the data collected after that. For example, the differences in help requests, interruptions, speech recognition rejections, silence timeouts, and repeat requests are highly significant, and in some cases, such as explicit quit requests, enormous (65% versus 3%).


doi: 10.21437/Interspeech.2006-158

Cite as: Turunen, M., Hakulinen, J., Kainulainen, A. (2006) Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences. Proc. Interspeech 2006, paper 1978-Tue2A3O.4, doi: 10.21437/Interspeech.2006-158

@inproceedings{turunen06b_interspeech,
  author={Markku Turunen and Jaakko Hakulinen and Anssi Kainulainen},
  title={{Evaluation of a spoken dialogue system with usability tests and long-term pilot studies: similarities and differences}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1978-Tue2A3O.4},
  doi={10.21437/Interspeech.2006-158}
}