ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Accurate and compact large vocabulary speech recognition on mobile devices

Xin Lei, Andrew Senior, Alexander Gruenstein, Jeffrey Sorensen

In this paper we describe the development of an accurate, smallfootprint, large vocabulary speech recognizer for mobile devices. To achieve the best recognition accuracy, state-of-the-art deep neural networks (DNNs) are adopted as acoustic models. A variety of speedup techniques for DNN score computation are used to enable real-time operation on mobile devices. To reduce the memory and disk usage, on-the-fly language model (LM) rescoring is performed with a compressed n-gram LM. We were able to build an accurate and compact system that runs well below real-time on a Nexus 4 Android phone.


doi: 10.21437/Interspeech.2013-189

Cite as: Lei, X., Senior, A., Gruenstein, A., Sorensen, J. (2013) Accurate and compact large vocabulary speech recognition on mobile devices. Proc. Interspeech 2013, 662-665, doi: 10.21437/Interspeech.2013-189

@inproceedings{lei13_interspeech,
  author={Xin Lei and Andrew Senior and Alexander Gruenstein and Jeffrey Sorensen},
  title={{Accurate and compact large vocabulary speech recognition on mobile devices}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={662--665},
  doi={10.21437/Interspeech.2013-189}
}