ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Efficient generation and use of MLP features for Arabic speech recognition

J. Park, F. Diehl, M. J. F. Gales, M. Tomalin, P. C. Woodland

Front-end features computed using Multi-Layer Perceptrons (MLPs) have recently attracted much interest, but are a challenge to scale to large networks and very large training data sets. This paper discusses methods to reduce the training time for the generation of MLP features and their use in an ASR system using a variety of techniques: parallel training of a set of MLPs on different data sub-sets; methods for computing features from by a combination of these networks; and rapid discriminative training of HMMs using MLP-based features. The impact on MLP frame-based accuracy using different training strategies is discussed along with the effect on word rates from incorporating the MLP features in various configurations into an Arabic broadcast audio transcription system.


doi: 10.21437/Interspeech.2009-84

Cite as: Park, J., Diehl, F., Gales, M.J.F., Tomalin, M., Woodland, P.C. (2009) Efficient generation and use of MLP features for Arabic speech recognition. Proc. Interspeech 2009, 236-239, doi: 10.21437/Interspeech.2009-84

@inproceedings{park09_interspeech,
  author={J. Park and F. Diehl and M. J. F. Gales and M. Tomalin and P. C. Woodland},
  title={{Efficient generation and use of MLP features for Arabic speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={236--239},
  doi={10.21437/Interspeech.2009-84}
}