INTERSPEECH 2014
15th Annual Conference of the International Speech Communication Association

Singapore
September 14-18, 2014

Automated Closed Captioning for Russian Live Broadcasting

K. Levin (1), I. Ponomareva (1), A. Bulusheva (1), G. Chernykh (2), I. Medennikov (1), N. Merkin (1), A. Prudnikov (1), Natalia Tomashenko (1)

(1) Speech Technology Center, Russia
(2) Saint-Petersburg State University, Russia

The paper describes a hardware-software system for real-time closed captioning of Russian live TV broadcasts. The use of respeaking technology enabled us to create an ASR system with WER not exceeding 5.5%. Editing closed captions in real time further reduces WER down to 0.2%. In the paper we report some advancements in LMs for a highly inflected language and also in using morphological rescoring of the decoder word lattice. We propose a solution of the punctuation problem and effective methods of real-time editing of ASR results. This system was successfully used during paralympic games in Sochi for live web-broadcasting on russiasport.ru. We are reporting work in progress and are planning to achieve even better ASR accuracy in the course of the next year.

Full Paper

Bibliographic reference.  Levin, K. / Ponomareva, I. / Bulusheva, A. / Chernykh, G. / Medennikov, I. / Merkin, N. / Prudnikov, A. / Tomashenko, Natalia (2014): "Automated closed captioning for Russian live broadcasting", In INTERSPEECH-2014, 1438-1442.