HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children

Anne S. Warlaumont, Mark VanDam, Elika Bergelson, Alejandrina Cristia


HomeBank is a new component of the TalkBank system, focused on long-form (i.e., multi-hour, typically daylong) real-world recordings of children’s language experiences, and it is linked to a GitHub repository in which tools for analyzing those recordings can be shared. HomeBank constitutes not only a rich resource for researchers interested in early language acquisition specifically, but also for those seeking to study spontaneous speech, media exposure, and audio environments more generally. This Show and Tell describes the procedures for accessing and contributing HomeBank data and code. It also overviews the current contents of the repositories, and provides some examples of audio recordings, available transcriptions, and currently available analysis tools.


Cite as: Warlaumont, A.S., VanDam, M., Bergelson, E., Cristia, A. (2017) HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children. Proc. Interspeech 2017, 815-816.


@inproceedings{Warlaumont2017,
  author={Anne S. Warlaumont and Mark VanDam and Elika Bergelson and Alejandrina Cristia},
  title={HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={815--816}
}