ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Construction of spoken language model including fillers using filler prediction model

Kengo Ohta, Masatoshi Tsuchiya, Seiichi Nakagawa

This paper proposes a novel method to construct a spoken language model including fillers from a corpus including no fillers using a filler prediction model. It consists of two submodels: a filler insertion model which predicts places where fillers should be inserted, and a filler selection model which predicts appropriate fillers for given places. It converts a corpus that covers domain-relevant topics but includes no fillers into a corpus that contains fillers as well as domain-relevant topics. The experiment against the corpus of spontaneous Japanese shows that language models constructed by the proposed method achieve quite near performance of the traditional trigram language model constructed from the real spontaneous corpus including fillers.


doi: 10.21437/Interspeech.2007-431

Cite as: Ohta, K., Tsuchiya, M., Nakagawa, S. (2007) Construction of spoken language model including fillers using filler prediction model. Proc. Interspeech 2007, 1489-1492, doi: 10.21437/Interspeech.2007-431

@inproceedings{ohta07_interspeech,
  author={Kengo Ohta and Masatoshi Tsuchiya and Seiichi Nakagawa},
  title={{Construction of spoken language model including fillers using filler prediction model}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={1489--1492},
  doi={10.21437/Interspeech.2007-431}
}