ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

SVitchboard 1: small vocabulary tasks from Switchboard

Simon King, Chris Bartels, Jeff Bilmes

We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0%). We justify the need for these tasks, describe the algorithm for selecting them from a large corpus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it.


doi: 10.21437/Interspeech.2005-869

Cite as: King, S., Bartels, C., Bilmes, J. (2005) SVitchboard 1: small vocabulary tasks from Switchboard. Proc. Interspeech 2005, 3385-3388, doi: 10.21437/Interspeech.2005-869

@inproceedings{king05_interspeech,
  author={Simon King and Chris Bartels and Jeff Bilmes},
  title={{SVitchboard 1: small vocabulary tasks from Switchboard}},
  year=2005,
  booktitle={Proc. Interspeech 2005},
  pages={3385--3388},
  doi={10.21437/Interspeech.2005-869}
}