7th International Conference on Spoken Language Processing

September 16-20, 2002
Denver, Colorado, USA

Japanese Broadcast News Transcription

Long Nguyen, Xuefeng Guo, Richard Schwartz, John Makhoul

BBN Technologies, USA

In this paper, we describe the on-going development of a Japanese Broadcast News Transcription system at BBN Technologies. This is a collaboration between BBN and NHK to use automatic speech recognition technology to provide live closed caption for NHK’s TV news programs in Japan. We describe what the NHK Broadcast News Corpus comprises and how we adopted transcription technology developed for Hub-4 English broadcast news task to achieve an overall word error rate (WER) of less than 5% for Japanese TV news programs. We also report on how we obtained 30-50% relative WER reduction for weather forecast and sports news by the use of micro-domain lexicons and language models.

Full Paper

Bibliographic reference.  Nguyen, Long / Guo, Xuefeng / Schwartz, Richard / Makhoul, John (2002): "Japanese broadcast news transcription", In ICSLP-2002, 1749-1752.