• Skip to content
  • Jump to main navigation and login

Nav view search

Navigation

  • Contact Us
  • Login
    • ISCA Members
    • ISCA Board
    • Conference Organisers
    • Jobs Administration
    • Workshops Administration
  • Archive
    • ISCApad Archive
    • Online Archive
    • Video Archive
    • Proceedings Order
  • Site Map
  • Home
  • About ISCA
  • Membership
  • Online Archive
  • Newsroom
  • ISCApad
  • SIGs
  • Conferences
  • Workshops
  • Call for Papers
  • Publications
  • Training
  • SCOOT
  • Honors
  • Grants
  • Distinguished Lecturers
  • Outreach Programs
  • Diversity
  • Liaison
  • Student Section
  • Jobs

ISCApad

ISCApad Archive  »  2016  »  ISCApad #219  »  Resources  »  Database  »  Annotation of “Hannah and her sisters” by Woody Allen.

ISCApad #219

Friday, September 23, 2016 by Chris Wellekens

5-2-8 Annotation of “Hannah and her sisters” by Woody Allen.
  

We have created and made publicly available a dense audio-visual person-oriented ground-truth annotation of a feature movie (100 minutes long): “Hannah and her sisters” by Woody Allen.

The annotation includes

•          Face tracks in video (densely annotated, i.e., in each frame, and person-labeled)

•             Speech segments in audio (person-labeled)

•             Shot boundaries in video



The annotation can be useful for evaluating



•   Person-oriented video-based tasks (e.g., face tracking, automatic character naming, etc.)

•             Person-oriented audio-based tasks (e.g., speaker diarization or recognition)

•             Person-oriented multimodal-based tasks (e.g., audio-visual character naming)



Detail on Hannah dataset and access to it can be obtained there:

https://research.technicolor.com/rennes/hannah-home/

https://research.technicolor.com/rennes/hannah-download/



Acknowledgments:

This work is supported by AXES EU project: http://www.axes-project.eu/










Alexey Ozerov Alexey.Ozerov@technicolor.com

Jean-Ronan Vigouroux,

Louis Chevallier

Patrick Pérez

Technicolor Research & Innovation



 


Top

Copyright @ 2009 - 2013 International Speech Communication Association    -