Sage: The New BBN Speech Processing Platform

Roger Hsiao, Ralf Meermeier, Tim Ng, Zhongqiang Huang, Maxwell Jordan, Enoch Kan, Tanel Alumäe, Jan Silovsky, William Hartmann, Francis Keith, Omer Lang, Manhung Siu, Owen Kimball


To capitalize on the rapid development of Speech-to-Text (STT) technologies and the proliferation of open source machine learning toolkits, BBN has developed Sage, a new speech processing platform that integrates technologies from multiple sources, each of which has particular strengths. In this paper, we describe the design of Sage, which allows the easy interchange of STT components from different sources. We also describe our approach for fast prototyping with new machine learning toolkits, and a framework for sharing STT components across different applications. Finally, we report Sage’s state-of-the-art performance on different STT tasks.


DOI: 10.21437/Interspeech.2016-1031

Cite as

Hsiao, R., Meermeier, R., Ng, T., Huang, Z., Jordan, M., Kan, E., Alumäe, T., Silovsky, J., Hartmann, W., Keith, F., Lang, O., Siu, M., Kimball, O. (2016) Sage: The New BBN Speech Processing Platform. Proc. Interspeech 2016, 3022-3026.

Bibtex
@inproceedings{Hsiao+2016,
author={Roger Hsiao and Ralf Meermeier and Tim Ng and Zhongqiang Huang and Maxwell Jordan and Enoch Kan and Tanel Alumäe and Jan Silovsky and William Hartmann and Francis Keith and Omer Lang and Manhung Siu and Owen Kimball},
title={Sage: The New BBN Speech Processing Platform},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-1031},
url={http://dx.doi.org/10.21437/Interspeech.2016-1031},
pages={3022--3026}
}