Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

Mandarin Telephone Speech Recognition Using MCE/GPD-Based Speaker Cluster HMM

Sen-Chia Chang, Shih-Chieh Chien, Woei-Chyang Shieh

E000/CCL Industrial Technology Research Institute, Chutung, Hsinchu, Taiwan

In this paper we successfully apply the MCE/GPD method to train speaker cluster HMM. The essential concept of our approach is to adjust all the parameters of the speaker cluster HMM simultaneously using each utterance of the whole training set. In other words, the parameters of each cluster-dependent HMM are no longer independently estimated by using only the training data of the speakers who belong to its corresponding cluster. To achieve this purpose, the discriminant function used in the MCE/GPD method need to be defined by the parameter set of the entire speaker cluster HMM. In our implementation, we define it as a function of the log-likelihood scores given the cluster-dependent HMMs. The proposed discriminative training procedure would increase the cluster separability and then improve the recognition rate.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Chang, Sen-Chia / Chien, Shih-Chieh / Shieh, Woei-Chyang (1999): "Mandarin telephone speech recognition using MCE/GPD-based speaker cluster HMM", In EUROSPEECH'99, 2709-2712.