7th International Conference on Spoken Language Processing
September 16-20, 2002
This paper describes the evaluation methodology and results of the 2001 DARPA Communicator evaluation. The experiment spanned 6 months of 2001 and involved eight DARPA Communicator systems in the travel planning domain. It resulted in a corpus of 1242 dialogs which include many more dialogues for complex tasks than the 2000 evaluation. We describe the experimental design, the approach to data collection, and the results. We compare the results by the type of travel plan and by system. The results demonstrate some large differences across sites and show that the complex trips are clearly more difficult.
Bibliographic reference. Walker, Marilyn A. / Rudnicky, Alexander I. / Prasad, Rashmi / Aberdeen, John / Bratt, Elizabeth Owen / Garofolo, John S. / Hastie, Helen / Le, Audrey N. / Pellom, Bryan / Potamianos, Alex / Passonneau, Rebecca / Roukos, Salim / Sanders, Gregory A. / Seneff, Stephanie / Stallard, David (2002): "DARPA communicator: cross-system results for the 2001 evaluation", In ICSLP-2002, 269-272.