ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

Speech recognition with support vector machines in a hybrid system

Sven E. Krüger, Martin Schafföner, Marcel Katz, Edin Andelic, Andreas Wendemuth

While the temporal dynamics of speech can be represented very efficiently by Hidden Markov Models (HMMs), the classification of speech into single speech units (phonemes) is usually done with Gaussian mixture models which do not discriminate well. Here, we use Support Vector Machines (SVMs) for classification by integrating this method in a HMM-based speech recognition system. In this hybrid SVM/HMM system we translate the outputs of the SVM classifiers into conditional probabilities and use them as emission probabilities in a HMM-based decoder. SVMs are very appealing due to their association with statistical learning theory. They have already shown very good classification results in other fields of pattern recognition. We train and test our hybrid system on the DARPA Resource Management (RM1) corpus. Our results show better performance than HMM-based decoder using Gaussian mixtures.

doi: 10.21437/Interspeech.2005-237

Cite as: Krüger, S.E., Schafföner, M., Katz, M., Andelic, E., Wendemuth, A. (2005) Speech recognition with support vector machines in a hybrid system. Proc. Interspeech 2005, 993-996, doi: 10.21437/Interspeech.2005-237

  author={Sven E. Krüger and Martin Schafföner and Marcel Katz and Edin Andelic and Andreas Wendemuth},
  title={{Speech recognition with support vector machines in a hybrid system}},
  booktitle={Proc. Interspeech 2005},