11th Annual Conference of the International Speech Communication Association

Makuhari, Chiba, Japan
September 26-30. 2010

Speech Recognition with a Seamlessly Updated Language Model for Real-Time Closed-Captioning

Toru Imai, Shinichi Homma, Akio Kobayashi, Takahiro Oku, Shoei Sato

NHK Science & Technology Research Laboratories, Japan

It is desirable to consistently and seamlessly update a language model of speech recognition without stopping it for online applications such as real-time closed-captioning. This paper proposes a novel speech recognition system that enables the model to be updated at any time even while it is running. It can run the second decoder with the latest model in parallel, and their priority that must be accessed is controlled at a non-speech portion by an additional job process, which sends acoustic features only to an active target decoder with the latest model and sends recognized words to the backend manual error correction for closed-captioning. The system seamlessly updates the model and ensures endless speech recognition with the latest model at any time. Our new practical real-time closed-captioning system reduced word errors by two thirds with the proposed language model update mechanism in the speech recognition and captioning experiments for Japanese broadcast news programs.

Full Paper

Bibliographic reference.  Imai, Toru / Homma, Shinichi / Kobayashi, Akio / Oku, Takahiro / Sato, Shoei (2010): "Speech recognition with a seamlessly updated language model for real-time closed-captioning", In INTERSPEECH-2010, 262-265.