EUROSPEECH 2003 - INTERSPEECH 2003
8th European Conference on Speech Communication and Technology

Geneva, Switzerland
September 1-4, 2003

        

A Discriminative Decision Tree Learning Approach to Acoustic Modeling

Sheng Gao (1), Chin-Hui Lee (2)

(1) Institute for Infocomm Research, Singapore
(2) Georgia Institute of Technology, USA

The decision tree is a popular method to accomplish tying of the states of a set context dependent phone HMMs for efficient and effective training of the large acoustic models. A likelihood-based impurity function is commonly adopted. It is well known that maximizing likelihood does not result in the maximal separation between the distributions in the leaves of the tree. To improve robustness, a discriminative decision tree learning approach is proposed. It embeds the MCE-GPD formulation in defining the impurity function so that the discriminative information could be taken into account while optimizing the tree. We compare the proposed approach with the conventional tree building using a Mandarin syllable recognition task. Our preliminary results show that the separation between the divided subspaces in the tree nodes is clearly enhanced although there is a slight performance reduction.

Full Paper

Bibliographic reference.  Gao, Sheng / Lee, Chin-Hui (2003): "A discriminative decision tree learning approach to acoustic modeling", In EUROSPEECH-2003, 1833-1836.