ISCA Archive AVSP 2003
ISCA Archive AVSP 2003

Enhanced auditory detection with av speech: perceptual evidence for speech and non-speech mechanisms

Lynne E. Bernstein, Sumiko Takayanagi, Edward T. jr. Auer

Speech in a noisy or reverberant environment is more detectable and more intelligible when the listener can see the talker. How to explain these perceptual phenomena is a fundamental problem for AV speech research. We have undertaken a series of behavioral and electrophysiological experiments to investigate the perceptual and neural bases for enhanced auditory speech detection in noise with AV stimuli. We hypothesize that the enhancement effect arises due to at least two neurophysiologically distinct mechanisms, one in no way specialized for speech and the other specific to speech stimuli. Here we report results of a perceptual experiment in which an auditory /ba/ token was presented adaptively to obtain its 71% detection threshold [1] in white noise. Participants were tested in three conditions, auditory-only speech, audiovisual speech, and auditory speech with a visual dynamic Lissajous figure. The Lissajous figure was a control for many of the complex visual features of speech. Evidence was obtained for two separate sources of AV detection enhancement: Detection thresholds were highest for the auditory-only speech, lower for the auditory speech with the Lissajous figure, and lowest for the audiovisual speech. Our Discussion section outlines the implications and limitations of the current results for explaining the AV speech detection enhancement effect.


Cite as: Bernstein, L.E., Takayanagi, S., Auer, E.T.j. (2003) Enhanced auditory detection with av speech: perceptual evidence for speech and non-speech mechanisms. Proc. Auditory-Visual Speech Processing, 13-17

@inproceedings{bernstein03_avsp,
  author={Lynne E. Bernstein and Sumiko Takayanagi and Edward T. jr. Auer},
  title={{Enhanced auditory detection with av speech: perceptual evidence for speech and non-speech mechanisms}},
  year=2003,
  booktitle={Proc. Auditory-Visual Speech Processing},
  pages={13--17}
}