ISCA Archive SLTU 2012
ISCA Archive SLTU 2012

Open and extendable speech recognition application architecture for mobile environments

Tanel Alumäe, Kaarel Kaljurand

This paper describes a cloud-based speech recognition architecture primarily intended for mobile environments. The system consists of a speech recognition server and a client for the Android mobile operating system. The system enables to implement Android’s Voice Input functionality for languages that are not supported by the default Google implementation. The architecture supports both large vocabulary speech recognition as well as grammar-based recognition, where grammars can be implemented in JSGF or Grammatical Framework. The system is open source and easily extendable. We used the architecture to implement Estonian speech recognition for Android.

Index Terms: Speech recognition, CMU Sphinx, mobile devices, Android, open source, Estonian, Grammatical Framework


Cite as: Alumäe, T., Kaljurand, K. (2012) Open and extendable speech recognition application architecture for mobile environments. Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012), 15-18

@inproceedings{alumae12_sltu,
  author={Tanel Alumäe and Kaarel Kaljurand},
  title={{Open and extendable speech recognition application architecture for mobile environments}},
  year=2012,
  booktitle={Proc. 3rd Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2012)},
  pages={15--18}
}