ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

The BBN RT04 English broadcast news transcription system

Long Nguyen, Bing Xiang, Mohamed Afify, Sherif Abdou, Spyros Matsoukas, Richard Schwartz, John Makhoul

This paper describes the BBN English Broadcast News transcription system developed for the EARS Rich Transcription 2004 (RT04) evaluation. In comparison to the BBN RT03 system, we achieved around 22% relative reduction in word error rate for all EARS BN development test sets. The use of additional acoustic training data acquired through Light Supervision based on thousands of hours of found data made the biggest contribution to the improvement. Better audio segmentation, through the use of an online speaker clustering algorithm and chopping speaker turns into moderately long utterances, also contributed substantially to the improvement. Other contributions, even of modest size but adding up nicely, include using discriminative training for all acoustic models, using word duration as an additional knowledge source during N-best rescoring, and using updated lexicon and language models.

doi: 10.21437/Interspeech.2005-546

Cite as: Nguyen, L., Xiang, B., Afify, M., Abdou, S., Matsoukas, S., Schwartz, R., Makhoul, J. (2005) The BBN RT04 English broadcast news transcription system. Proc. Interspeech 2005, 1673-1676, doi: 10.21437/Interspeech.2005-546

  author={Long Nguyen and Bing Xiang and Mohamed Afify and Sherif Abdou and Spyros Matsoukas and Richard Schwartz and John Makhoul},
  title={{The BBN RT04 English broadcast news transcription system}},
  booktitle={Proc. Interspeech 2005},