The paper describes formant based speaker zation method suitable for speech visualization and articulation training systems. The method estimates the error function obtained from speaker formant characteristics for a given vowel. Estimated error function gives information for critical band filter shifting on mel-warped frequency scale. The paper also describes accurate technique for formant tracking.
Cite as: Ogner, M., Kacic, Z. (1999) Speaker normalization for audio-visual articulation training. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 579-582, doi: 10.21437/Eurospeech.1999-149
@inproceedings{ogner99_eurospeech, author={Marcel Ogner and Zdravko Kacic}, title={{Speaker normalization for audio-visual articulation training}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={579--582}, doi={10.21437/Eurospeech.1999-149} }