ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

Impact of speaking style and speaking task on acoustic models

Janienke Sturm, Hans Kamperman, Lou Boves, Els den Os

The loss in performance caused by mismatch between train and test material suggests a need for task specific acoustic models, especially for highly demanding tasks. However, since the training of these models is extremely expensive, general purpose models are more attractive. In this paper we address the impact of mismatch in speaking style and task. We trained three sets of acoustic models on data from different tasks, involving both read and extemporaneous speech. The average utterance length in the training corpora varied between 10.5 and 1.2 words. The models were tested on matched as well on very different tasks. The results suggest that general purpose models trained from short utterances are to be preferred in most spoken dialog systems. However, these models might not perform adequately in dictation tasks.


Cite as: Sturm, J., Kamperman, H., Boves, L., Os, E.d. (2000) Impact of speaking style and speaking task on acoustic models. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 361-364

@inproceedings{sturm00_icslp,
  author={Janienke Sturm and Hans Kamperman and Lou Boves and Els den Os},
  title={{Impact of speaking style and speaking task on acoustic models}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 361-364}
}