Sixth International Conference on Spoken Language Processing
(ICSLP 2000)

Beijing, China
October 16-20, 2000

A Tool to Build a Treebank for Conversational Chinese

Yves Lepage, Nicolas Auclerc, Satoshi Shirai

ATR-SLT, Seika-tyo, Soraku-gun, Kyoto, Japan

N-grams have been extensively used with phonemes or words as basic units in speech recognition. Recently, it has been proposed to use n-grams with phrase tree structures as units to increase speech recognition quality. In order to test this idea on Chinese, a treebank of Chinese hotel reservation con- versation utterances is needed. Because no such treebank is yet available, we have to build it. We propose to see the process of building a tree-bank as a sequence of edition and search operations:

This way of doing will have a benefic "snow-ball" effect: the bigger the treebank, the faster and the more consistent its extension.


Full Paper

Bibliographic reference.  Lepage, Yves / Auclerc, Nicolas / Shirai, Satoshi (2000): "A tool to build a treebank for conversational Chinese", In ICSLP-2000, vol.3, 985-988.