Interspeech'2005 - Eurospeech
While the temporal dynamics of speech can be represented very efficiently by Hidden Markov Models (HMMs), the classification of speech into single speech units (phonemes) is usually done with Gaussian mixture models which do not discriminate well. Here, we use Support Vector Machines (SVMs) for classification by integrating this method in a HMM-based speech recognition system. In this hybrid SVM/HMM system we translate the outputs of the SVM classifiers into conditional probabilities and use them as emission probabilities in a HMM-based decoder. SVMs are very appealing due to their association with statistical learning theory. They have already shown very good classification results in other fields of pattern recognition. We train and test our hybrid system on the DARPA Resource Management (RM1) corpus. Our results show better performance than HMM-based decoder using Gaussian mixtures.
Bibliographic reference. Krüger, Sven E. / Schafföner, Martin / Katz, Marcel / Andelic, Edin / Wendemuth, Andreas (2005): "Speech recognition with support vector machines in a hybrid system", In INTERSPEECH-2005, 993-996.