ISCA Archive TAL 2006
ISCA Archive TAL 2006

The pause duration prediction for Mandarin text to speech system

Jianhua Tao, Jian Yu

In the paper, we enter into detailed analysis on how the pause duration under different prosodic boundaries are affected by various contextual factors in natural speech. To get the correlation between them, the paper calculates the mean pause duration under different prosodic boundaries. The contextual factors investigated in this paper contains both linguistic features, such as boundary types, syllable tones of boundary sides, initial and final types etc, and acoustic features, such as pitch gap across the boundary. The paper made experiments and discussions to reveal the influence of these factors on pause duration. Based on that, the paper creates a pause duration prediction model for mandarin speech synthesis system. The model was proved to be able to generate high quality prosody output with the listening test.


Cite as: Tao, J., Yu, J. (2006) The pause duration prediction for Mandarin text to speech system. Proc. 2nd International Symposium on Tonal Aspects of Languages (TAL 2006), 95-98

@inproceedings{tao06_tal,
  author={Jianhua Tao and Jian Yu},
  title={{The pause duration prediction for Mandarin text to speech system}},
  year=2006,
  booktitle={Proc. 2nd International Symposium on Tonal Aspects of Languages (TAL 2006)},
  pages={95--98}
}