An analysis-by-synthesis study of Mandarin speech prosody

Na Zhi, Daniel Hirst, Pier Marco Bertinetto, Aijun Li, Yuan Jia


In the present paper, an analysis by synthesis study of Mandarin speech prosody is carried out. The Mandarin prosodic features are discussed from two salient perspectives, specifically: the function of prosody and the form of prosody. The symbolic representation of prosodic form with the INTSINT (INternational Transcription System for INTonation) system [1] reduces the surface complexity of a prosodic contour to a simplified model, which contains the essential information expressing the functions of speech prosody. A proposed mapping rule between the representation of prosodic function and the representation of prosodic form is discussed and further evaluated in ProZed [2, 3, 4, 5] by generating synthesized utterances. It is suggested in the study that the synthesized Mandarin data derived from the prosodic coding of INTSINT symbols can not only closely mirror the melodic features of the original utterances, but also correctly express the prosodic functions of tones and the global intonation.


DOI: 10.21437/SpeechProsody.2016-22

Cite as

Zhi, N., Hirst, D., Bertinetto, P.M., Li, A., Jia, Y. (2016) An analysis-by-synthesis study of Mandarin speech prosody. Proc. Speech Prosody 2016, 104-108.

Bibtex
@inproceedings{Zhi+2016,
author={Na Zhi and Daniel Hirst and Pier Marco Bertinetto and Aijun Li and Yuan Jia},
title={An analysis-by-synthesis study of Mandarin speech prosody},
year=2016,
booktitle={Speech Prosody 2016},
doi={10.21437/SpeechProsody.2016-22},
url={http://dx.doi.org/10.21437/SpeechProsody.2016-22},
pages={104--108}
}