10th Annual Conference of the International Speech Communication Association

Brighton, United Kingdom
September 6-10, 2009

Very Large Vocabulary Voice Dictation for Mobile Devices

Jan Nouza, Petr Cerva, Jindrich Zdansky

Technical University of Liberec, Czech Republic

This paper deals with optimization techniques that can make very large vocabulary voice dictation applications deployable on recent mobile devices. We focus namely on optimization of signal parameterization (frame rate, FFT calculation, fixed-point representation) and on efficient pruning techniques employed on the state and Gaussian mixture level. We demonstrate the applicability of the proposed techniques on the practical design of an embedded 255K-word discrete dictation program developed for Czech. Its real performance is comparable to a client-server version of the fluent dictation program implemented on the same mobile device.

Full Paper

Bibliographic reference.  Nouza, Jan / Cerva, Petr / Zdansky, Jindrich (2009): "Very large vocabulary voice dictation for mobile devices", In INTERSPEECH-2009, 995-998.