This paper describes a segmental feature extraction and speech coding method in an acoustic-articulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between formant contours and those of articulatory parameters and found the effectiveness of the articulatory domain for organizing acoustic-phonetic features with little dependency upon languages. Next, we applied the method to the low bit rate coder and confirmed that good quality speech synthesis was achieved in the condition of 18 bit used for articulatory code words.
Cite as: Ohmura, H., Tanaka, K. (1999) Segmental feature extraction and coding for speech synthesis. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 1471-1474, doi: 10.21437/Eurospeech.1999-334
@inproceedings{ohmura99_eurospeech, author={H. Ohmura and K. Tanaka}, title={{Segmental feature extraction and coding for speech synthesis}}, year=1999, booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)}, pages={1471--1474}, doi={10.21437/Eurospeech.1999-334} }