In this paper we describe the RTTS system for enterprise-level real time speech recognition and translation. RTTS follows a Web Service-based approach which allows the encapsulation of ASR and MT Technology components thus hiding the configuration and tuning complexities and details from the client applications while exposing a uniform interface. In this way, RTTS is capable of easily supporting a wide variety of client applications. The clients we have implemented include a VoIP-based real time speech-to-speech translation system, a chat and Instant Messaging translation System, a Transcription Server, among others.
Cite as: Huerta, J.M., Wu, C., Sakrajda, A., Caskey, S., Jan, E.-E., Faisman, A., Ben-David, S., Liu, W., Lee, A., Stewart, O., Frissora, M., Lubensky, D. (2009) RTTS: towards enterprise-level real-time speech transcription and translation services. Proc. Interspeech 2009, 436-439, doi: 10.21437/Interspeech.2009-157
@inproceedings{huerta09_interspeech, author={Juan M. Huerta and Cheng Wu and Andrej Sakrajda and Sasha Caskey and Ea-Ee Jan and Alexander Faisman and Shai Ben-David and Wen Liu and Antonio Lee and Osamuyimen Stewart and Michael Frissora and David Lubensky}, title={{RTTS: towards enterprise-level real-time speech transcription and translation services}}, year=2009, booktitle={Proc. Interspeech 2009}, pages={436--439}, doi={10.21437/Interspeech.2009-157} }