Multimodal alignment between classroom lecture utterances and lecture slide components is one of the crucial problems to realize a multimodal e-Learning application. This paper proposes the new method for the automatic alignment, and formulates the alignment as the integer linear programming (ILP) problem to maximize the score function which consists of three factors: the similarity score between utterances and slide components, the consistency of the explanation order, and the explanation coverage of slide components. The experimental result on the Corpus of Japanese classroom Lecture Contents (CJLC) shows that the automatic alignment information acquired by the proposed method is effective to improve the performance of the automatic extraction of important utterances.
Cite as: Tsuchiya, M., Minamiguchi, R. (2017) Automatic Alignment Between Classroom Lecture Utterances and Slide Components. Proc. Interspeech 2017, 2904-2908, doi: 10.21437/Interspeech.2017-1752
@inproceedings{tsuchiya17_interspeech, author={Masatoshi Tsuchiya and Ryo Minamiguchi}, title={{Automatic Alignment Between Classroom Lecture Utterances and Slide Components}}, year=2017, booktitle={Proc. Interspeech 2017}, pages={2904--2908}, doi={10.21437/Interspeech.2017-1752} }