ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

On combining vocal tract length normalisation and speaker adaptation for noise robust speech recognition

Ramalingam Hariharan, Olli Viikki

This paper investigates the combination of vocal tract length normalisation and speaker adaptation in con-nected digit recognition. In particular, we focus on performing this task under a continuously varying car noise environment. Continuous supervised speaker and environment adaptation is carried out on the test data according to the Bayesian framework. The paper also evaluates various approaches to implement vocal tract length normalisation. The best performance was obtained when the normalisation was performed during both initial speaker-independent training and testing. It was also noticed that, during testing, speaker specific normalisation produced better results than utterance specific normalisation. Our experimental results on the connected digit database show that the joint approach outperforms the system in which on-line Bayesian speaker adaptation is performed on HMM mean parameters. The performance gain was particularly high with so called outlier speakers for whom adaptation is truly needed.


doi: 10.21437/Eurospeech.1999-57

Cite as: Hariharan, R., Viikki, O. (1999) On combining vocal tract length normalisation and speaker adaptation for noise robust speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 215-218, doi: 10.21437/Eurospeech.1999-57

@inproceedings{hariharan99_eurospeech,
  author={Ramalingam Hariharan and Olli Viikki},
  title={{On combining vocal tract length normalisation and speaker adaptation for noise robust speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={215--218},
  doi={10.21437/Eurospeech.1999-57}
}