Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Error Correction Translation Using Text Corpora

Kai Ishikawa, Eiichiro Sumita

ATR Interpreting Telecommunications Research Laboratories, Seika-cho, Soraku-gun, Kyoto, Japan

In this paper, we propose an error correction method using text corpora. In this method, recognition errors are corrected using phonetically similar examples in the text corpora. The reliability of the correction hypotheses are judged according to their semantic consistency and their phonetic similarity to the original input. We previously proposed an error correction method that uses a treebank [1]. However, the previous method was not flexible in its use of examples, because structural mismatches occurred between the input and examples due to recognition errors. In our new proposal, examples are treated as morpheme sequences. This enables us to use examples partially when there are no useful full-sentence-examples. We built our proposed method into a speech translation system and compared the translation quality for simple translation and translation with error correction. The rate of acceptable translation increased about 10% with our proposed method compared to simple translation.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Ishikawa, Kai / Sumita, Eiichiro (1999): "Error correction translation using text corpora", In EUROSPEECH'99, 1995-1998.