INTERSPEECH 2004 - ICSLP
8th International Conference on Spoken Language Processing

Jeju Island, Korea
October 4-8, 2004

The Audio-Video Australian English Speech Data Corpus AVOZES

J. Bruce Millar (1), Roland Goecke (2)

(1) Australian National University, Australia
(2) Autonomous Systems & Sensing Technologies, Australia

This paper presents the Audio-Video Australian English Speech data corpus AVOZES. It contains recordings of 20 speakers uttering a variety of phrases. The corpus was designed for research on the statistical relationship of audio and video speech parameters with an audio-video (AV) automatic speech recognition (ASR) task in mind, but may be useful for other research tasks. AVOZES is the first published AV speaking-face data corpus for Australian English and is novel in its use of a stereo camera system for the video recordings and its modular design.

Full Paper

Bibliographic reference.  Millar, J. Bruce / Goecke, Roland (2004): "The audio-video australian English speech data corpus AVOZES", In INTERSPEECH-2004, 2525-2528.