ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Speaker adaptation using regularization and network adaptation for hybrid MMI-NN/HMM speech recognition

Jörg Rottland, Christoph Neukirchen, Daniel Willett, Gerhard Rigoll

This paper describes, how to perform speaker adaptation for a hybrid large vocabulary speech recognition system. The hybrid system is based on a Maximum Mutual Information Neural Net-work (MMINN), which is used as a Vector Quantizer (VQ) for a discrete HMM speech recognizer. The combination of MMINNs and HMMs has shown good performance on several large vocabulary speech recognition tasks like RM and WSJ. This paper now presents two approaches to perform speaker adaptation with this hybrid system. The first approach is a trans-formation of the feature space, which is performed by a neural network with maximum likelihood (ML) as objective function for the complete system, which means, that the parameters of the NN are estimated in order to match the HMM-parameters of the pretrained speaker independent system. The second approach is to adapt the HMM parameters depending on the amount of training data available per HMM, using a regularization approach. Both approaches can be applied jointly, which further improves the recognition accuracy.


doi: 10.21437/Eurospeech.1999-58

Cite as: Rottland, J., Neukirchen, C., Willett, D., Rigoll, G. (1999) Speaker adaptation using regularization and network adaptation for hybrid MMI-NN/HMM speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 219-222, doi: 10.21437/Eurospeech.1999-58

@inproceedings{rottland99_eurospeech,
  author={Jörg Rottland and Christoph Neukirchen and Daniel Willett and Gerhard Rigoll},
  title={{Speaker adaptation using regularization and network adaptation for hybrid MMI-NN/HMM speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={219--222},
  doi={10.21437/Eurospeech.1999-58}
}