ISCA Archive SSW 2004
ISCA Archive SSW 2004

Accurate spectral envelope estimation for articulation-to-speech synthesis

Yoshinori Shiga, Simon King

This paper introduces a novel articulatory-acoustic mapping in which detailed spectral envelopes are estimated based on the cepstrum, inclusive of the high-quefrency elements which are discarded in conventional speech synthesis to eliminate the pitch component of speech. For this estimation, the method deals with the harmonics of multiple voiced-speech spectra so that several sets of harmonics can be obtained at various pitch frequencies to form a spectral envelope. The experimental result shows that the method estimates spectral envelopes with the highest accuracy when the cepstral order is 48-64, which suggests that the higher order coeffcients are required to represent detailed envelopes reflecting the real vocal-tract responses.


Cite as: Shiga, Y., King, S. (2004) Accurate spectral envelope estimation for articulation-to-speech synthesis. Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5), 19-24

@inproceedings{shiga04_ssw,
  author={Yoshinori Shiga and Simon King},
  title={{Accurate spectral envelope estimation for articulation-to-speech synthesis}},
  year=2004,
  booktitle={Proc. 5th ISCA Workshop on Speech Synthesis (SSW 5)},
  pages={19--24}
}