This paper investigates the use of language identification (LID) in real-time speech-to-speech translation systems. We propose a framework that incorporates LID capability into a speech-tospeech translation system while minimizing the impact on the systems real-time performance. We compared two phone-based LID approaches, namely PRLM and PPRLM, to a proposed extended approach based on Conditional Random Field classifiers. The performances of these three approaches were evaluated to identify the input language in the CMU English-Iraqi TransTAC system, and the proposed approach obtained significantly higher classification accuracies on two of the three test sets evaluated.
Cite as: Lim, D.C.Y., Lane, I. (2009) Language identification for speech-to-speech translation. Proc. Interspeech 2009, 204-207, doi: 10.21437/Interspeech.2009-76
@inproceedings{lim09_interspeech, author={Daniel Chung Yong Lim and Ian Lane}, title={{Language identification for speech-to-speech translation}}, year=2009, booktitle={Proc. Interspeech 2009}, pages={204--207}, doi={10.21437/Interspeech.2009-76} }