8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

Data Pruning Approach to Unit Selection for Inventory Generation of Concatenative Embeddable Chinese TTS Systems

Zhenli Yu (1), Kaizhi Wang (2), Yiqing Zu (1), Dongjian Yue (1), Guilin Chen (1)

(1) Motorola Labs, China
(2) Shanghai Jiaotong University, China

In this paper, a data pruning approach is presented for building acoustic unit inventory for syllable-based concatenative embeddable Chinese TTS system. A 3-portion segmentation of a syllable is proposed based on the nature of voiced/unvoiced structure of Chinese syllable. Individual factorial acoustic measurement of syllable is used to calculate the penalty of perceptual unsatisfactory for concatenation. With respect to the calculated penalties, bad syllables are removed from a cluster. The best syllable of each pruned cluster is selected with a compromised acoustic measurement. The evaluation and application result shows that the method is promising particularly to generate acoustic unit database for small footprint concatenative Chinese (Cantonese and Mandarin) TTS systems.

Full Paper

Bibliographic reference.  Yu, Zhenli / Wang, Kaizhi / Zu, Yiqing / Yue, Dongjian / Chen, Guilin (2004): "Data pruning approach to unit selection for inventory generation of concatenative embeddable Chinese TTS systems", In INTERSPEECH-2004, 1177-1180.