ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

CASS: a phonetically transcribed corpus of mandarin spontaneous speech

Aijun Li, Fang Zheng, William Byrne, Pascale Fung, Terri Kamm, Yi Liu, Zhanjiang Song, Umar Ruhi, Veera Venkataramani, XiaoXia Chen

A collection of Chinese spoken language has been collected and phonetically annotated to capture spontaneous speech and language effects. The Chinese Annotated Spontaneous Speech (CASS) corpus contains phonetically transcribed spontaneous speech. This corpus was created to begin to collect samples of most of the phonetic variations in Mandarin spontaneous speech due to pronunciation effects, including allophonic changes, phoneme reduction, phoneme deletion and insertion, as well as duration changes. It is intended for use in pronunciation modeling for improved automatic speech recognition and will be used at the 2000 Johns Hopkins University Language Engineering Workshop by the project on Pronunciation Modeling ofMandarin Casual Speech.


Cite as: Li, A., Zheng, F., Byrne, W., Fung, P., Kamm, T., Liu, Y., Song, Z., Ruhi, U., Venkataramani, V., Chen, X. (2000) CASS: a phonetically transcribed corpus of mandarin spontaneous speech. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 485-488

@inproceedings{li00b_icslp,
  author={Aijun Li and Fang Zheng and William Byrne and Pascale Fung and Terri Kamm and Yi Liu and Zhanjiang Song and Umar Ruhi and Veera Venkataramani and XiaoXia Chen},
  title={{CASS: a phonetically transcribed corpus of mandarin spontaneous speech}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 485-488}
}