ISCA Archive Interspeech 2008
ISCA Archive Interspeech 2008

Discriminative rescoring based on minimization of word errors for transcribing broadcast news

Akio Kobayashi, Takahiro Oku, Shinichi Homma, Shoei Sato, Toru Imai, Tohru Takagi

This paper describes a novel method of rescoring that reflects tendencies of errors in word hypotheses in speech recognition for transcribing broadcast news, including ill-trained spontaneous speech. The proposed rescoring assigns penalties to sentence hypotheses according to the recognition error tendencies in the training lattices themselves using a set of weighting factors for feature functions activated by a variety of linguistic contexts. Word hypotheses with low possibilities of correct words are penalized while those with high possibilities are rewarded by the weighting factors. We introduce two types of training techniques to obtain the factors. The first is based on conditional random fields (CRFs), and the second is based on the minimization of word errors, which explicitly reduces expected word errors. The results of transcribing Japanese broadcast news achieved a word error rate (WER) of 10.38%, which was a 6.06% reduction relative to conventional lattice rescoring.


doi: 10.21437/Interspeech.2008-260

Cite as: Kobayashi, A., Oku, T., Homma, S., Sato, S., Imai, T., Takagi, T. (2008) Discriminative rescoring based on minimization of word errors for transcribing broadcast news. Proc. Interspeech 2008, 1574-1577, doi: 10.21437/Interspeech.2008-260

@inproceedings{kobayashi08_interspeech,
  author={Akio Kobayashi and Takahiro Oku and Shinichi Homma and Shoei Sato and Toru Imai and Tohru Takagi},
  title={{Discriminative rescoring based on minimization of word errors for transcribing broadcast news}},
  year=2008,
  booktitle={Proc. Interspeech 2008},
  pages={1574--1577},
  doi={10.21437/Interspeech.2008-260}
}