Third International Conference on Spoken Language Processing (ICSLP 94)

Yokohama, Japan
September 18-22, 1994

Estimating Performance of Pipelined Spoken Language Translation Systems

Manny Rayner (1), David Carter (1), Patti Price (2), Bertil Lyberg (3)

(1) SRI International, Cambridge, UK
(2) SRI International, Menlo Park, CA, USA
(3) Telia Research AB, Haninge, Sweden

Most spoken language translation systems developed to date rely on a pipelined architecture, in which the main stages are speech recognition, linguistic analysis, transfer, generation and speech synthesis. When making projections of error rates for systems of this kind, it is natural to assume that the error rates for the individual components are independent, making the system accuracy the product of the component accuracies. The paper reports experiments carried out using the SRI-SICS-Telia Research Spoken Language Translator and a 1000-utterance sample of unseen data. The results suggest that the naive performance model leads to serious overestimates of system error rates, since there are in fact strong dependencies between the components. Predicting the system error rate on the independence assumption by simple multiplication resulted in a 16% proportional overestimate for all utterances, and a 19% overestimate when only utterances of length 1-10 words were considered.

Full Paper

Bibliographic reference.  Rayner, Manny / Carter, David / Price, Patti / Lyberg, Bertil (1994): "Estimating performance of pipelined spoken language translation systems", In ICSLP-1994, 1251-1254.