Movie subtitle alignment is a potentially useful approach for deriving automatically parallel bilingual/multilingual spoken language data for automatic speech translation. In this paper, we consider the movie subtitle alignment task. We propose a distance metric between utterances of different languages based on lexical features derived from bilingual dictionaries. We use the dynamic time warping algorithm to obtain the best alignment. The best F-score of ~0.713 is obtained using the proposed approach.
Cite as: Tsiartas, A., Ghosh, P.K., Georgiou, P.G., Narayanan, S.S. (2009) Context-driven automatic bilingual movie subtitle alignment. Proc. Interspeech 2009, 444-447, doi: 10.21437/Interspeech.2009-159
@inproceedings{tsiartas09_interspeech, author={Andreas Tsiartas and Prasanta Kumar Ghosh and Panayiotis G. Georgiou and Shrikanth S. Narayanan}, title={{Context-driven automatic bilingual movie subtitle alignment}}, year=2009, booktitle={Proc. Interspeech 2009}, pages={444--447}, doi={10.21437/Interspeech.2009-159} }