ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

The broadcast narrow band speech corpus: a new resource type for large scale language recognition

Christopher Cieri, Linda Brandschain, Abby Neely, David Graff, Kevin Walker, Chris Caruso, Alvin F. Martin, Craig S. Greenberg

This paper describes a new resource type, broadcast narrow band speech for use in large scale language recognition research and technology development. After providing the rational for this new resource type, the paper describes the collection, segmentation, auditing procedures and data formats used. Along the way, it addresses issues of defining language and dialect in found data and how ground truth is established for this corpus.


doi: 10.21437/Interspeech.2009-732

Cite as: Cieri, C., Brandschain, L., Neely, A., Graff, D., Walker, K., Caruso, C., Martin, A.F., Greenberg, C.S. (2009) The broadcast narrow band speech corpus: a new resource type for large scale language recognition. Proc. Interspeech 2009, 2867-2870, doi: 10.21437/Interspeech.2009-732

@inproceedings{cieri09_interspeech,
  author={Christopher Cieri and Linda Brandschain and Abby Neely and David Graff and Kevin Walker and Chris Caruso and Alvin F. Martin and Craig S. Greenberg},
  title={{The broadcast narrow band speech corpus: a new resource type for large scale language recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={2867--2870},
  doi={10.21437/Interspeech.2009-732}
}