ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Model-based speaker normalization methods for speech recognition

Masaki Naito, Li Deng, Yoshinori Sagisaka

We address the problem of how vocal-tract (VT) parameters and and the related VT geometric model can be used effectively to normalize the speech acoustic properties of the speakers. The problem is important since speaker variability is one major obstacle to high-accuracy speech recognition and use of VT parameters offers a natural way to account for such a variability. The data-driven methods used in the conventional technology for speaker adaptation requires a large amount of adaptation data, but our experimental results show the new model-based speaker normalization method described in this paper is superior in performance while drastically reducing the amount of adaptation data needed to normalize speakers.


doi: 10.21437/Eurospeech.1999-550

Cite as: Naito, M., Deng, L., Sagisaka, Y. (1999) Model-based speaker normalization methods for speech recognition. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2515-2518, doi: 10.21437/Eurospeech.1999-550

@inproceedings{naito99_eurospeech,
  author={Masaki Naito and Li Deng and Yoshinori Sagisaka},
  title={{Model-based speaker normalization methods for speech recognition}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2515--2518},
  doi={10.21437/Eurospeech.1999-550}
}