Sixth European Conference on Speech Communication and Technology

Budapest, Hungary
September 5-9, 1999

A New Hybrid Structure of Speech Recognizer Based on HMM and Neural Network

Jianlai Zhou, Xiaodong He, Tiecheng Yu, Fuyuan Mo

Speech Processing Laboratory, Institute of Acoustics, Chinese Academy of Sciences, Beijing, China

In this paper, we introduced a new framework of speech recognizer based on HMM and neural net. Unlike the traditional hybrid system, the neural net was used as a post processor, which classify the speech data segmented by HMM recognizer. The purpose of this method is to improve the top-choice accuracy of HMM based speech recognition system in our lab. Major issues such as how to use the segmentation information of HMM in neural net, the structure of the neural net, the choice of the error metric for training neural net, and the determination of the training procedure are investigated within a set of experiments. In these experiments, we attempt to recognize 68 phoneme like units in continuous speech. Our results indicate that this is a potential method. About 20% can be obtained to improve the recognition accuracy for multi-speaker system in syllable level, and 10% for speaker independent system.

Full Paper (PDF)   Gnu-Zipped Postscript

Bibliographic reference.  Zhou, Jianlai / He, Xiaodong / Yu, Tiecheng / Mo, Fuyuan (1999): "A new hybrid structure of speech recognizer based on HMM and neural network", In EUROSPEECH'99, 1131-1134.