ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Identifying linguistic segmentations in Chinese spoken dialogue

Yue-Shi Lee, Hsin-Hsi Chen

In a continuous speech recognition system, a longer waveform is usually segmented into some shorter pieces based on simple acoustic criteria, such as unfilled pauses (i.e., silences). We call such a kind of segmentation as an acoustic segmentation. In general, the acoustic segmentations do not reflect the linguistic structure. They may fragment sentences or semantic units. Besides, they may also group together some unrelated units. Therefore, we need to resegment acoustic segmentations in order to output linguistically meaningful units such as clauses. We call such a kind of segmentation as a linguistic segmentation. This paper employs several acoustic and prosodic clues to resegment acoustic segmentations for identifying linguistic segmentations. Based on these clues, the experimental results show that a precision rate of 94.46% and a recall rate of 87.38% can be achieved.


doi: 10.21437/Eurospeech.1999-442

Cite as: Lee, Y.-S., Chen, H.-H. (1999) Identifying linguistic segmentations in Chinese spoken dialogue. Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999), 2003-2006, doi: 10.21437/Eurospeech.1999-442

@inproceedings{lee99b_eurospeech,
  author={Yue-Shi Lee and Hsin-Hsi Chen},
  title={{Identifying linguistic segmentations in Chinese spoken dialogue}},
  year=1999,
  booktitle={Proc. 6th European Conference on Speech Communication and Technology (Eurospeech 1999)},
  pages={2003--2006},
  doi={10.21437/Eurospeech.1999-442}
}