INTERSPEECH 2004 - ICSLP
This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.
Bibliographic reference. Millar, J. Bruce / Goecke, Roland (2004): "The audio-video australian English speech data corpus AVOZES", In INTERSPEECH-2004, 2525-2528.