11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Accelerating Hierarchical Acoustic Likelihood Computation on Graphics Processors

Pavel Květoň (1), Miroslav Novák (2)

(1) IBM Research, Czech Republic
(2) IBM T.J. Watson Research Center, USA

The paper presents a method for performance improvements of a speech recognition system by moving a part of the computation - acoustic likelihood computation - onto a Graphics Processor Unit (GPU). In the system, GPU operates as a low cost powerful coprocessor for linear algebra operations. The paper compares GPU implementation of two techniques of acoustic likelihood computation: full Gaussian computation of all components and a significantly faster Gaussian selection method using hierarchical evaluation. The full Gaussian computation is an ideal candidate for GPU implementation because of its matrix multiplication nature. The hierarchical Gaussian computation is a technique commonly used on a CPU since it leads to much better performance by pruning the computation volume. Pruning techniques are generally much harder to implement on GPUs, nevertheless, the paper shows that hierarchical Gaussian computation can be efficiently implemented on GPUs.

Full Paper

Bibliographic reference.  Květoň, Pavel / Novák, Miroslav (2010): "Accelerating hierarchical acoustic likelihood computation on graphics processors", In INTERSPEECH-2010, 350-353.