webASR 2 — Improved Cloud Based Speech Technology

Thomas Hain, Jeremy Christian, Oscar Saz, Salil Deena, Madina Hasan, Raymond W.M. Ng, Rosanna Milner, Mortaza Doulaty, Yulan Liu


This paper presents the most recent developments of the webASR service (www.webasr.org), the world’s first web-based fully functioning automatic speech recognition platform for scientific use. Initially released in 2008, the functionalities of webASR have recently been expanded with 3 main goals in mind: Facilitate access through a RESTful architecture, that allows for easy use through either the web interface or an API; allow the use of input metadata when available by the user to improve system performance; and increase the coverage of available systems beyond speech recognition. Several new systems for transcription, diarisation, lightly supervised alignment and translation are currently available through webASR. The results in a series of well-known benchmarks (RT’09, IWSLT’12 and MGB’15 evaluations) show how these webASR systems provides state-of-the-art performances across these tasks.


DOI: 10.21437/Interspeech.2016-700

Cite as

Hain, T., Christian, J., Saz, O., Deena, S., Hasan, M., Ng, R.W., Milner, R., Doulaty, M., Liu, Y. (2016) webASR 2 — Improved Cloud Based Speech Technology. Proc. Interspeech 2016, 1613-1617.

Bibtex
@inproceedings{Hain+2016,
author={Thomas Hain and Jeremy Christian and Oscar Saz and Salil Deena and Madina Hasan and Raymond W.M. Ng and Rosanna Milner and Mortaza Doulaty and Yulan Liu},
title={webASR 2 — Improved Cloud Based Speech Technology},
year=2016,
booktitle={Interspeech 2016},
doi={10.21437/Interspeech.2016-700},
url={http://dx.doi.org/10.21437/Interspeech.2016-700},
pages={1613--1617}
}