11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

An Implementation of Decision Tree-Based Context Clustering on Graphics Processing Units

Nicholas Pilkington, Heiga Zen

Toshiba Research Europe Ltd., UK

Decision tree-based context clustering is essential but time-consuming while building HMM-based speech synthesis systems. It seeks to cluster HMM states (or streams) based on their context to maximize the log likelihood of the model to the training data. Its widely used implementation is not designed to take advantage of highly parallel architectures, such as GPUs. This paper shows an implementation of tree-based clustering for these highly parallel architectures. Experimental results showed that the new implementation running on GPUs was an order of magnitude faster than the conventional one running on CPUs.

Full Paper

Bibliographic reference.  Pilkington, Nicholas / Zen, Heiga (2010): "An implementation of decision tree-based context clustering on graphics processing units", In INTERSPEECH-2010, 833-836.