Call My Net Corpus: A Multilingual Corpus for Evaluation of Speaker Recognition Technology

Karen Jones, Stephanie Strassel, Kevin Walker, David Graff, Jonathan Wright


The Call My Net 2015 (CMN15) corpus presents a new resource for Speaker Recognition Evaluation and related technologies. The corpus includes conversational telephone speech recordings for a total of 220 speakers spanning 4 languages: Tagalog, Cantonese, Mandarin and Cebuano. The corpus includes 10 calls per speaker made under a variety of noise conditions. Calls were manually audited for language, speaker identity and overall quality. The resulting data has been used in the NIST 2016 SRE Evaluation and will be published in the Linguistic Data Consortium catalog. We describe the goals of the CMN15 corpus, including details of the collection protocol and auditing procedure and discussion of the unique properties of this corpus compared to prior NIST SRE evaluation corpora.


 DOI: 10.21437/Interspeech.2017-1521

Cite as: Jones, K., Strassel, S., Walker, K., Graff, D., Wright, J. (2017) Call My Net Corpus: A Multilingual Corpus for Evaluation of Speaker Recognition Technology. Proc. Interspeech 2017, 2621-2624, DOI: 10.21437/Interspeech.2017-1521.


@inproceedings{Jones2017,
  author={Karen Jones and Stephanie Strassel and Kevin Walker and David Graff and Jonathan Wright},
  title={Call My Net Corpus: A Multilingual Corpus for Evaluation of Speaker Recognition Technology},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={2621--2624},
  doi={10.21437/Interspeech.2017-1521},
  url={http://dx.doi.org/10.21437/Interspeech.2017-1521}
}