EUROSPEECH 2003 - INTERSPEECH 2003
Natural concept generation is critical to statistical interlingua-based speech translation performance. To improve maximum-entropy-based concept generation, a set of novel features and algorithms are proposed including features enabling model training on parallel corpora, employment of confidence thresholds and multiple sets of features. The concept generation error rate is reduced by 43%-50% in our speech translation corpus within limited domains. Improvements are also achieved in our experiments on speech-to-speech translation.
Bibliographic reference. Gu, Liang / Gao, Yuqing / Picheny, Michael (2003): "Improving statistical natural concept generation in interlingua-based speech-to-speech translation", In EUROSPEECH-2003, 2769-2772.