ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Multi-strategy data mining on Mandarin prosodic patterns

Yiqiang Chen, Wen Gao, Tingshao Zhu, Jiyong Ma

Mandarin prosodic models are very important in speech research and synthesis, which mainly describes the variation of pitch. The models that are now being used in most Chinese Text-To-Speech systems are constructed by expert, qualitatively and with low precision. In this paper, we propose a Multi-strategy Data Mining framework to extract prosodic patterns from actual large Mandarin speech database to improve the naturalness and intelligibility of synthesized speech. In data preprocessing, typical prosody models are found by clustering analysis, and Rough Set is employed for feature selection. ANN and Decision tree are trained respectively. The prediction result of ANN and Decision Tree are integrated to generate fundamental frequency and energy contours. The experimental results showed that synthesized prosodic features quite resembled their original counterparts for most syllables.


Cite as: Chen, Y., Gao, W., Zhu, T., Ma, J. (2000) Multi-strategy data mining on Mandarin prosodic patterns. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 2, 59-62

@inproceedings{chen00e_icslp,
  author={Yiqiang Chen and Wen Gao and Tingshao Zhu and Jiyong Ma},
  title={{Multi-strategy data mining on Mandarin prosodic patterns}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 2, 59-62}
}