Recent Advances in the processing capabilities of Personal Digital Assistants (PDAs) have enabled the implementation of an end-to-end speech translation system on these devices. We have presented a bidirectional speech-to-speech (English and Chinese) translation system, which is hosted on a PDA running embedded Linux. Our Multilingual Automatic Speech-to-Speech Translator (MASTOR) system includes an HMM-based large vocabulary continuous speech recognizer using statistical n-grams, a translation module, and a multi-language speech synthesis system. This paper describes our recent efforts to develop a bi-directional English and colloquial Arabic speech-to-speech system on a device running the popular Windows-CE operating system. We have created completely new translation modules and introduced a fast adaptation scheme for a new speaker or environment. In addition, the componentization of system modules speeds up the development of a new speech-to-speech system.
Cite as: Zhu, W., Zhou, B., Prosser, C., Krbec, P., Gao, Y. (2006) Recent advances of IBM’s handheld speech translation system. Proc. Interspeech 2006, paper 1590-Tue2CaP.9, doi: 10.21437/Interspeech.2006-358
@inproceedings{zhu06_interspeech, author={Weizhong Zhu and Bowen Zhou and Charles Prosser and Pavel Krbec and Yuqing Gao}, title={{Recent advances of IBM’s handheld speech translation system}}, year=2006, booktitle={Proc. Interspeech 2006}, pages={paper 1590-Tue2CaP.9}, doi={10.21437/Interspeech.2006-358} }