Workshop on Spoken Language Processing

January 9-11, 2003
Tata Institute of Fundamental Research, Mumbai, India

A Comparison of Public-Domain Software Tools for Speech Recognition

K. Samudravijaya and Maria Barot

School of Technology and Computer Science, Tata Institute of Fundamental Research, Mumbai, India

HTK and Sphinx are two freely downloadable software packages with the capability of implementing a large vocabulary, speaker independent, continuous speech recognition system in any language. While HTK has been in use by various groups for about a decade, and has gone through the refinement cycles necessary for a commercial software, Sphinx was released about a year ago and is still undergoing development in a university environment. However, due to certain advanced features and the license for unrestricted use, Sphinx appears to be more attractive. These two software packages have been compared by implementing a Hindi speech recognition system. Although recognition accuracies of the two systems are comparable, we observe that the acoustic modeling of Sphinx is superior.


Full Paper (PDF)   Full Paper (Zipped Postscript)

Bibliographic reference.  Samudravijaya, K. / Barot, Maria (2003): "A Comparison of Public-Domain Software Tools for Speech Recognition", In WSLP-2003, 125-131.