Sixth International Conference on Spoken Language Processing (ICSLP 2000)

Beijing, China
October 16-20, 2000

Comparison of HMM Experts with MLP Experts in the Full Combination Multi-Band Approach to Robust ASR

Astrid Hagen, Andrew Morris

Dalle Molle Institute for Perceptual Artificial Intelligence (IDIAP), Martigny, Switzerland

In this paper we apply the Full Combination (FC) multi-band approach, which has originally been introduced in the framework of posterior-based HMM/ANN (Hidden Markov Model/Artificial Neural Network) hybrid systems, to systems in which the ANN (or Multilayer Perceptron (MLP)) is itself replaced by a Multi Gaussian HMM (MGM). Both systems represent the most widely used statistical models for robust ASR (automatic speech recognition). It is shown how the FC formula for the likelihood-based MGMs can easily be derived from the posterior-based approach by simply applying Bayes’ Rule. The experiments show that the Full Combination multi-band system with MGM experts performs better, in all noise conditions tested, than the simple sum and product rules which are normally used. As compared to the baseline full-band system, the FC system shows increased robustness mainly on band-limited noise. The goal of this article is not a performance comparison between Multilayer Perceptrons and Multi Gaussian Models but between the theory of the two approaches, posterior-based vs. likelihood-based FC approach, so results are only given for the MGMs.

Full Paper

Bibliographic reference.  Hagen, Astrid / Morris, Andrew (2000): "Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR", In ICSLP-2000, vol.1, 345-348.