ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

A comparative study on acoustic and linguistic characteristics using speech from human-to-human and human-to-machine conversations

Toshiyuki Takezawa, Fumiaki Sugaya, Masaki Naito, Seiichi Yamamoto

Speech translation and dialogue systems must accept conversational speech. In this paper, we discuss acoustic and linguistic characteristics based on results of speech recognition experiments using speech from human-to-human and human-to-machine conversations. Conversational speech inputs to machines consist of frozen expressions such as greetings and yes/no statements, and informative individual expressions like numerical data such as dates and telephone numbers. The former has a lower perplexity and acoustic characteristics close to spontaneous speech. The latter has a higher perplexity and acoustic characteristics close to read speech. Each utterance or each inter-pausal unit can be classified into the former or the latter. This new knowledge will help future research on speech translation and dialogue systems.


Cite as: Takezawa, T., Sugaya, F., Naito, M., Yamamoto, S. (2000) A comparative study on acoustic and linguistic characteristics using speech from human-to-human and human-to-machine conversations. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 3, 522-525

@inproceedings{takezawa00_icslp,
  author={Toshiyuki Takezawa and Fumiaki Sugaya and Masaki Naito and Seiichi Yamamoto},
  title={{A comparative study on acoustic and linguistic characteristics using speech from human-to-human and human-to-machine conversations}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 3, 522-525}
}