ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR

Astrid Hagen, Andrew Morris

In this paper we apply the Full Combination (FC) multi-band approach, which has originally been introduced in the framework of posterior-based HMM/ANN (Hidden Markov Model/Artificial Neural Network) hybrid systems, to systems in which the ANN (or Multilayer Perceptron (MLP)) is itself replaced by a Multi Gaussian HMM (MGM). Both systems represent the most widely used statistical models for robust ASR (automatic speech recognition). It is shown how the FC formula for the likelihood-based MGMs can easily be derived from the posterior-based approach by simply applying BayesÂ’ Rule. The experiments show that the Full Combination multi-band system with MGM experts performs better, in all noise conditions tested, than the simple sum and product rules which are normally used. As compared to the baseline full-band system, the FC system shows increased robustness mainly on band-limited noise. The goal of this article is not a performance comparison between Multilayer Perceptrons and Multi Gaussian Models but between the theory of the two approaches, posterior-based vs. likelihood-based FC approach, so results are only given for the MGMs.


Cite as: Hagen, A., Morris, A. (2000) Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 345-348

@inproceedings{hagen00_icslp,
  author={Astrid Hagen and Andrew Morris},
  title={{Comparison of HMM experts with MLP experts in the full combination multi-band approach to robust ASR}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 345-348}
}